PROCESS FOR EXCHANGING INFOR3y[ATION 
IN A MULTIPROCESSOR SYSTEM 
The invention relates to a multiprocessor system of the type 
comprising a central memory, treatment processors and cache 
memories associated with treatment processors. It also relates 
to a process for the exchange of information between central 
memory and treatment processors via the cache memory associated 
with each of these processors. It also provides a new integrated 
circuit component, capable of equipping the multiprocessor 
system. 

It is known that, in the most common known multiprocessor 
systems, all the information (data, address) is relayed by a 
common parallel communication bus between the central memory and 
the various treatment processors, which constitutes a bottleneck: 
its transfer rate is in effect insufficient to .feed all the 
processors for full efficiency, from a common central memory. 

For increasing the information transfer rate, a first 
solution consists in associating with each treatment processor 
a cache memory which, by the location of the information, permits 
reducing the demands on the central memory. However, in the case 
in which the volume of data shared between processors is 
substantial, the maintenance of coherence of the data between 
memories generates complementary information traffic on the 
communication bus which resists a significant reduction of the 
overall flow on this bus, and therefor removes a large part of 
the interest in this solution. 

Another solution consists in providing the communication bus 
in the form of a grid network designed as a "crossbar", which 
permits a direct communication between each treatment processor 
and each subassembly of the Qentral memory (memory bank) . 
However, this solution is very heavy and very costly to achieve 
because of the very great number of interconnections, and it 
becomes completely unrealistic beyond about ten treatment 
processors. Moreover, in the case of multiple' demands of several 
processors on the same memory bank, such a solution implies 
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access conflicts, a source of slowing up the exchanges. 

Another more current solution by reason it its architectural 
simplicity consists in associating a local memory with each 
treatment processor for storing specific data therein, and 
storing the transferred data in the common . central memory. 
However, the great deficiency of this architecture is its non- 
transparency, that is, the need for the programmer to organize 
the detail of the allocation of data in the various memories, 
such that this solution is of a very constrained usefulness. 
Moreover, in the case of high volume of transferred data, it may 
lead as' before. to a saturation of the access bus in the central 
memory. 

A solution which has been called "aquarius architecture" has 
been proposed by the University of Berkeley and consists in 
improving the aforementioned crossbar solution by combining with 
the crossbar network, for the shared data, cache memories which 
are connected to the crossbar network, and for the shared data, 
distinct cache memories which are connected to a common 
synchronization bus. This solution contributes a gain in speed 
of exchange but remains very heavy and very costly to achieve. 

The present invention seeks to provide a new solution, 
permitting considerably increasing the flow rate of information 
exchange, while retaining an architecture which is transparent 
for the user, much simpler than the crossbar architecture. 

An object of the invention is thus to permit notably 
increasing the number of treatment processors of the system, 
while benefitting from a high efficiency for each processor. 

Another object is to provide a structure of an integrated 
circuit component, permitting a very simple realization of the 
architecture of titis new multiprocessor system. 

To this end, the multiprocessor system provided by the 
invention is of the type comprising a central memory (RAM) 
organized in blocks of information (bi) , treatment processors 
(CPUi. . .CPUj. . .CPU„) , a cache memory (MCj) connected to each 
treatment processor ( CPU, . . . CPUj . . . CPU„ ) a cache memory (MCj ) 
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connected to each treatment processor (CPup and organized in 
blocks of information (b.) of the same size as those of the 
central memory, a directory (Rcp and its management processor 
(PGP associated with each cache memory (MCp , means for 
communication of addresses of blocks between processors (CPU.) 
and a central memory (RAM) . According to the present invention, 
the multiprocessor system is provided with: 

an assembly of shift registers, termed memory shift 
registers (RDMl. . .RDI^. . .ROMn) , each register (RDlL) of this 
assembly being connected to the central memory (RAM) "in such a 
manner as to permit, in one cycle' of this memory, a parallel 
transfer in read or write of a block of information (bj) between 
said register and said central memory; 

shift registers, termed processor shift registers 
(RDPl...RDP,...RDPn), each processor shift register (RDP.) being 
connected to the cache memory (Mcp of a processor (CPU.) in such 
a manner as to permit a parallel transfer in reading or writing 
of a block of information (b,) between said shift register (RDR) 
and said cache memory (MCj) ; • 

an assembly of series links (LS,. . .LSj. . .lsj , each 
connecting a memory shift register (RDI^) and a processor shift 
register (RDPp and adapted to permit the transfer of blocks of 
information (b,) between the two registers considered (RDM., 
RDPj) . ^ 

Thus, in the multiprocessor system according to the 
invention, the exchanges between the cache memories and the 
associated processors are carried out as in the conventional 
systems provided with cache memories. By contrast, the exchanges 
between the central memory and the cache memories is carried out 
in an entirely original manner. 

Each transfer of an information block (b,) from the central 
memory (RAM) to the cache memory (MC^ of a given processor 
(CPUj) consists of: 

transferring, in a cycle of the central memory, the block 
(bj) of said central memory (RAM) to the memory shift registet ' 



(RDl^) (of the size of one block) which is directly connected to. 
the central memory and which corresponds to the processor (CPU,) 
considered, 

transferring on the corresponding series link (LSj) the 
contents of this memory shift register (RDl^.) to the processor 
shift, register (RDP,) (of the same capacity) which is associated 
with the cache memory (MCp of the processor considered (CPUj) , 

transferring the contents of said processor shift register 
(RDPj) to the cashe memory (MC,) . 

In the opposite direction, each transfer of information 
blocks (b,) from the cache, memory .(Mc,) of . a given processor 
(CPDj) to the central memory (RAM) consists of: 

transferring the block (b,) of said cache memory considered 
(MC,) to the processor shift register (RDPj) which is associated 
with said cache memory (MC,) , 

transferring on the corresponding series link (LSj) the 
contents of the processor shift register (RDP,) to the memory- 
shift register (RDM,) , allocated to the processor considered 
(among the assembly of shift registers (RDMl. . .RDI^, . .RDMh) 
connected to the central memory (RAM) , 

transferring in a cycle of the central memory, the contents 
of the memory shift register (RDf^) to said central memory (RAM) . 

In these conditions, the transfer of each block of 
information (b,) is carried out, no longer through a parallel 
bus as is the case in the known systems, but by the seriesl links 
of high flow rate. These series links permit obtaining 
comparable times of transfer for each block (bj) and even lower 
than the transfer times in known parallel bus systems. The 
comparative example given hereinbelow with the current values 
of the parameter for current technology, illustrates clearly this 
fact which seems paradoxical. 

It is assumed that each block of information (bi) is of a 
size equal to 64 octets. 

In the system of the inv ntion, the transfer time between 
the central memory and a cache memory breaks down into: 



4 



a central memory transfer time (RAM) /memory shift register 
(RDl^) : 100 nanoseconds (performance of a central random access 
memory of known type) , 

a series transfer time on the corresponding series link: 
64 X S X 1/500.10*, either 1024 nanoseconds, assuming a transfer 
frequency of 500 megahertz (not exceptional with current 
technology which permits attaining frequencies of 3000 
megahertz) , • 

a processor shift register transfer time (RDP,) /cache memory 
(MCj) : 50 nanoseconds (cache memory of the very current type) . 

The total time of transfer of a block is. therefor on the 
order of 1200 nanoseconds (while integrating the chaining delays 
of the second order) . 

In known systems with cache memories in which the exchanges 
of information is carried out directly in parallel by words of 
4. octets (the most current systems leading to busses of the 
conventional type of 32 data lines), the transfer time for one 
block is equal to the transfer time of 16 words of 4 octets which 
comprise this block, that is: is x 100 = ISOO nanoseconds. 

Thus, it is seen that, with the average hypotheses in the 
two solutions, these times are comparable. But, if one compares 
the architecture of the system according to the invention with 
that of a parallel bus common with cache memories (first solution 
mentioned previously) , it will be realized that: 

in the conventional solution (common parallel bus), the 
central memory and the; common bus are occupied at 100% during 
the transfer, since the information circulates between the two 
for the entire transfer time, 

in the system according to the invention, the series link 
is occupied 100% during the transfer, but the central memory is 
occupied less that 10% of the transfer time (time of memory 
reading and loading of the memory shift register (RDM,)), such 
that the central memory may serve 10 times more processors than 
in the preceding case (the use of the' series link being without 
significance since it is private and directed to the processor) . 
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It IS inportant to emphasize that in this system of the 
invention, each series connection which connects each processor 
m ,a„ xnaivxaual nanner to the central .neBory is a simple 
connection (of one or two data leads), such that the series 
network, thus constituted is not comparable in the overall plan 
*o the comple:.lty with, for example, a crossbar network of which 
each connection is a parallel connection with multiplicity of 
leads (32 leads of data in the comparative Example above) , with 
ail Of the necessary switches. 

Further, as will be seen below on the comparative curves, 
the system according to the invention has greatly improved 
performance with respect to the traditional common bus systems 
and permits in practice operating a much higher number of 
processors (of several tens to a hundred processors) . This 
perform«,c. is compatible with that of a crossbar system, but 
the sy.t«» according to the invention is of a much greater 
architectural sixaplicity. - 

in the system of the inv«,tion, each series link may in 
practice be achiev«J either by means of two unidirectional seriii 
links for bit by bit transfer, or by means of a single 
txidirectional series link. 

In the first case, each memory shift register; (RDM,) and 
each processor shift register (rdp,) are divided into two 
registers, one specialized for the transfer in one direction, 
the other for the transfer in the other direction. The two 
unidirectional series links are then connected to the divided 
memory shift register (RDM,) and to the corresponding divided 
processor shift register (RDP,) , in such a manner as to permit, 
for one, a transfer in one direction, and for the other, a 
transfer in the other direction. 

■ This embodiment with two unidirectional 'links presents the 
advantage of not requiring any transfer management on the link, 
but the inconvenience of doubling the necessary resources (link, 
registers). / ' 

in the second case, a validation logic of the transfer " 



direction is associated with the bidirectional link such as to 
permit an alternate transfer in the two directions on said link. 
This logic may be integrated in the management processor (PGj) 
associated with the cache memory (MCj) to which said 
bidirectional link is connected. 

• It will be understood that each series link may ultimately 
be provided with a higher number of series links. 

In the multiprocessor system according to the invention, 
the address communication means may cover essentially two forms 
embodiments: in the first case, it may consist of a "parallel 
address communication bus for blocks (BUSA) , conaaon .to all of 
theprocessors (CPU,) and connecting the latter and the central 
memory (RAM) in a conventional manner with an arbitrator bus (AB) 
adapted to manage access conflicts on said bus. It is necessary 
to note that this address bus is only utilized for communication 
of addresses of blocks: in the plan of the structure, this bus 
is identical to the parallel address communication bus of known 
systems, for which no problems of saturation are interposed, 
since it will be freed right after transfer of the address block. 

However, another embodiment of this address communication 
means may be considered in the multiprocessor system of the 
invention, consisting in operating the series links for transfer 
of blocks of information (b,) to transfer the addresses of these 
blocks . 

In this case, a complementary shift register (RDCj) is 
connected to . each series link (LSj) in parallel with the 
corresponding memory shift register (RDC,) . " The addresses 
transmitted by said series link are thus loaded into each of 
these complementary registers (RDCj) . An access management 
arbitrator connected to said registers (RDCj) and .to the central 
memory (RAM) is thus provided for selecting the addresses 
contained in said registers and for managing the conflicts of 
access to the central memory (RAM) . Such an arbitrator is 
conceivably known it itself, this type of access conflicts being 
now resolved for a number of years. In this embodiment, the 
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presence of a parallel conmunication address bus is avoided, but 
the management resources are made more heavy. 

Further, the multiprocessor system according to the 
invention is particularly well suited for managing in an 
efficient manner the problems of coherence of the data shared 
between treatment processors. In effect, the conventional 
solutions for managing these shared data find their limits in 
the known systems from the fact of the bottleneck at the level 
of the communication of information, but become, on the contrary, 
perfectly satisfactory and efficient in the system of the 
invention where such a bottleneck no longer exists, such that 
this system may be equipped with shared data management means of 
an analogous concept to that of known systems. 

For example, one traditional solution of shared data 
management consists in avoiding the relay of shared data by the 
cache memories: in a conventional mauiner, a partition logic 
(LPj) is associated with each treatment processor (CPU,) in order 
to differentiate the addresses of the shared data and those of 
the non-shared data so as to direct the first directly toward the 
central memory (RAM) and the second toward the corresponding 
cache memory (MC,) • 

In a first version of the architecture according to the 
invention, the system comprises: 

a special bus for parallel communication of words (BUSD)- 
connecting the processors (CPUj) and the central memory (RAM) , 

a partition, logic (UP,) associated with each processor 
(CPU,) and adapted to differentiate the addresses of the shared 
data and those of the non-shared data in such a manner as to 
transmit the non-shared data on the address communication means 
with their identification, 

a decoding logic (DEC) associated with the central memory 
(RAM) and adapted to receive the addresses with their 
identification and to direct the data into the memory output 
- either to the corresponding memory shift register (RDM,) for the 
non-shared data, or to the special word commtinication bus (BUSD) 

8 



for, the shared data. 

This solution presents the advantage of being very simple 
in the architectural plan. The presence of the special parallel 
communication bus (BUSD) leads to better performances with 
respect to a solution which consists in utilizing the series, 
connections for transferring not only the blocks of non-shared 
data but also the words of shared data. It should be noted that 
this latter solution may, in some cases, be provided in case of 
low flow of shared data. 

In another version, the system is provided with a special 
bus for parallel communication of words and a special common bus 
for communication of addresses of words (BUSAM) in order to 
transfer the data by the special Word bus (BUSD) , and direct the 
non-shared data to the address communication means (which may 
coipaprise a parallel communication bus where the communication is 
carried out by the series links) . 

' The presence of a special bus for communication of addresses 
of words permits, in this version, to move back the saturation 
limit of the address communication means, in case of high demand 
for shared data. 

Another version which will be preferred in practice in the 
case in which the address communication means comprises a 
parallel address commxinication bus (BUSA) consists in providing 
the system with a memory management processor (PGM) associated 
with the memory (RAM) and a snooper processor with a bus (PEj) 
associated with each treatment processor (CPUj) and to the 
corresponding management directory (RGj) . The memory management 
processor (PGM) and each (espion) processor, of structures known 
in themselves , are connected to the address communication bus 
(BUSA) in order respectively to oversee and to treat the 
addresses of blocks transmitted on said bus in such a manner as 
to permit an updating of the central memory (RAM) and of the 
associated cache memory (MCj) in case of detection of an address 
of a block present in the associated directory (RGj) . 

The memory management processor (PGM) and each snooper 



processor (PEj) associate status bits of each block of 
inf oirmation ^ holding them op n as a function of the nature (read 
or write) of the requirements of the block which transitions on 
the bus (BUSA) and assures the coherence of the shared data while 
using these status bits which permit them to force or not write 
a block into the central memory at the moment of the requests on 
the bus (BUSA) . 

In the case referred to previously where the communications 
of addresses are made by the series connections, the management 
of shared data may also be assured in a centralized manner, by 
a memory management processor (PGM) associated with the central 
memory (RAM) and a processor for maintaining the coherence of the 
shared data (PMCj) associated with" each treatment processor 
(CPUj) and with the corresponding management directory (RGj) , each 
coherence maintenance processor being connected to a 
synchronization bus (SYNCHRO) controlled by the memory management 
processor (PGM), in such a manner as to permit an updating of 
the central memory (RAM) and of the associated cache memories 
(MCj) in' case of detection of an address block, an updating of 
the central memory (RAM) and the cache memories (MC,) at each 
address selection in the complementary shift registers (RDCj) . 

As before this operation is assured due to the status bits 
associated with each block of information by the processor (PGM) • 

It should be noted that a synchronization bus of the type 
hereinafter defined may, in some cases, be provided in the 
preceding architecture where the address of blocks move on a 
common address bus BUSA. In this case, the snoopea? processors 
(PEj) are urged by the memory management processor (PGM) via the 
synchronization bus, and this only when they are concerned by the 
transfer. Thus, non-useful access' to the cache memories is - 
avoided. , The snooper processors become then passive (since 
driven by the processor PGM) and they are designated more by the 
more appropriate expression "coherence maintenance processor" 
according, to the terminology hereinabove utilized. 

Another isolution consists in reserving the parallel address 
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communication bus (BUSA) for the transfer of addresses of blocks 
of shared data and using the series links for the transfer of 
blocks of non-shared data. 

Further, the multiprocessor system according to the 
invention lends itself to the regroupings of treatment processors 
on a same series link, in such a manner as to limit the series 
links and the corresponding memory shift registers (RDI^) 
necessary. 

The number of memory shift registers (RDI^) may correspond 
to the number of series links (LSp , to which case each memory . 
shift register (RDM,) is connected in a static manner to a series 
link (LSj) specifically appropriated to said register. 

The number of memory shift registers (rDm.) may also be 
different from that of the series connections (LS^) and in 
particular less,, in which case these registers are connected in 
a dynamic manner to the series links (Lsp through an 
interconnection network. 

As in conventional systems, the central memory (RAM) may 
be divided into 'm' memory banks (RAMj. . .RAMp. . .RAM„) arranged in ' 
parallel. Each memory shift register (RDM,) is then comprised 
of m elementary registers (RDl^l. . .RDM,p. . .RDM^m) connected in 
parallel to the corresponding series link (LSp . However, a 
level of supplementary parallelism and a better electrical or 
optical adaptation of the connection is obtained in a variation 
in which each memory bank RAM^ is connected to each processor 
CPUj by a series link from point to point LSjp. 

In order to provide transfer performance at least equal to 
those of conventional systems with a parallel bus, the system 
according to the invention is preferably synchronized by a clock 
of a frequency F at least equal to 100 "megahertz. The memory 
shift registers (RDM,) and processor shift registers (RDP,) may 
very simply be of a type adapted to present a shift frequency at 
least equal to F. 

In the case of- very high frequencies (particularly greater 
than 500 megahertz with current technology) , the registers may 
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be divided into sub-registers of a lower shift frequency, and 
then multiplexed. 

The invention also relates to a multiport series memory 
component, susceptible of equipping the multiprocessor system 
previously described, in .order to simplify the fabrication. 
This component, which may have different applications, is 
constituted by an integrated circuit comprising a random access 
memory (RAM) of a pre-determined width corresponding to a block 
of information (bi) , an assembly of shift registers 
(RDMi — RDM,...RDM„) , each of a capacity corresponding to the size 
of. the memory, an internal parallel bus (BUSI) connecting the 
access of the memory and the shift registers, a selection logic 
of a shift tegister (LSR) adapted to validate the connection on 
the. internal bus between the memory and a predetermined shift 
register, and an assembly of external input/ output pins for the 
input of addresses to the memory (RAM) , for the input of 
addresses to the selection program (LSR), for the input and the 
validation of transfer commands in read or write of a block of 
information (bi) between the memory (RAM) and the shift registers 
(RDMj) , for the input of a clock signal to each shift register 
(ROMj) for the input bit by bit of a block of information (bi) 
to each shift register (RDM^) and for the output bit by bit of a 
block of information from each shift register (RDM^) . 

This component may be made parametrable by the adjunction 
of configuration registers (Rq, RCj, . . . ) permitting particularly 
a choice of sizes of blocks of informat.ion (bj) and of diverse 
modes of operation of the shift registers. 

The invention having been described in its general form, 
is illustrated by the description which follows in reference to 
the accompanying drawings which show without limitation several, 
embodiments. In these drawings, which form an integral part of 
the present description: 

FIGURE 1 is a block diagram of a first embodiment of the 
multiprocessor system- according to the invention; 

FIGURE 2* is a diagram giving the calculated performance 
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curve of this system (A) and, by way of comparison, the 
corresponding curve (B) for a conventional multiprocessor 
architecture with a common bus; 

FIGURES 3, 4 and 5 are detailed logic schematics of 
functional units of the system of FIGURE 1; 

FIGURE 6 is a block diagram of another embodiment of the 
system; 

FIGURE 7 is a block diagreim of a system of the type as that 
of FIGURE 1, provided with shared data management means; 

FIGURE 8 is a detailed logic diagram of a sub-assembly of 
the system of FIGURE 7; 

FIGURE 9 is a block diagram of a system analogous to that 
of FIGURE 7 with a variation in the shared data management means; 

FIGURE 10 is a block diagram of an analogous system, 
provided with a different .shared data management means; 

FIGURES 11/ 12a, 12b, 12c, 12d, 13, 14, 15, 16 17 are 
detailed logic- schematics of the functional units of the 
processor system of FIGURE 10; 

FIGURE 18 is a block diagram of a system of the type as 
that of FIGURE 6, provided with means for shared data management; 

FIGURE 19. is a simplified block diagram of a variation of 
the system, in which several central units share the same series 
link; 

FIGURE 20a is a block diagram of a preferred embodiment, 
in which the central memory is organized into several memory 
banks, 

FIGURE 20b is a variation of the architecture shown in 
FIGURE 2 OA; 

FIGURES 21a and 21b schematically show another structure 
of the memory RAM susceptible of equipping said system; "and 

FIGURE 22 is a block diagram showing the structure of a 
multiport series memory component, capeible of equipping the 
system. 

• The apparatus presented -in the form of a block diagram in 
FIGURE 1 is a multiprocessor system having n treatment processors 
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CpUi...CPUj...CPU„. Shown in this figure are two treatment 
processors CPUl and CPU, with their associated logic. Each of 
these treatment processors is of a conventional type, for example 
"Motorola 68020" or "Intel 80386"... and with peripheral 
interfaces and heing provided with a virtual memory device. 

The apparatus comprises a central random access memory RAM 
provided in a conventional manner from integrated memory 
circuits: in particular dynamic RAM "INTEL," "NEC," 

"TOSHIBA"... of 256 Kbits, 1 Mbits, 4 Mbits. . . depending upon the 
application. This memory is organized in blocks of information 
b,. . . b,. . . of predetermined size t (usually 256 bits per 2 Kbits) 
and the access front of said memory corresponds to the size of 
ohe block. 

The. central memory is connected in parallel to n shift 
registers RDMi... RDM,... RDM„ called memory registers, each 
memory register having the size t of one block of information. 
Each of these registers is provided from high speed technology 
("ASGA") , one latch able to be loaded or unloaded in one cycle 
of the central memory RAM. The number n of registers is equal 
to the number of processors CPUj. 

A cache mempry MC, is associated in a known manner with each 
processor CPU,. Each cache memory comprises a conventional high 
speed random access memory, of low capacity with respect to the 
central memory RAM. A directory RGj and a management processor 
PGj are, in a traditional manner, connected to the cache memory 
and to the treatment processor for managing th? information 
flowing in the cache memory. 

Further, in the system of the invention, a shift register 
RDPj- called a processor register is connected by its parallel 
port to each cache memory MC,. Each processor register RDPj is 
of a size corresponding to that of a block bi and of a structure 
similar to that of the memory registers RDM,. 

Each memory register RDM, is connected by its series port 
to the series port of a processor register RDPj by a series link 
LSj. Examples of the provision of this series link, which may 
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comprise a bidirectional link or two unidirectional links, are 
illustrated in FIGURES 4 and 5. The control of the transfer of 
the blocks b, between corresponding registers RDM, and RDP, is 
assured by the transfer logic TFR, and TFR' which are associated 
in a symmetrical mannei; with the memory register RDJ^ and with 
the processor register RDPj. One embodiment of these transfer 
logics (known in themselves) is detailed in figure 3. 

The central memory ram, memory shift registers RDMi...RDM„, 
and associated transfer logics TFRi...tfr„ comprise a functional 
assembly called a "multiport series memory" MMS. The treatment 
processor assembly CPU,, cache memory MC,, management directory 
for the cache RG,, management processor of the cache pg,, shift 
register processor RDP, and the associated transfer logic TFR'. 
constitute a "functional" assembly called a "central unit" UC,. 

Further, the system comprises means for communication of 
addresses of blocks of the processors CPUj to the central memory 
RAM, comprising in the example a common parallel communication 
bus BUSA pn which are connected the processors CPU, (though their 
management -processor PQj) and the central memory RAM. 

Access to the bus BUSA is controlled in a conventional 
manner by a bus arbitrator AB. 

The general operation of the architecture hereinabove 
defined is as follows: 

A processor CPUj carries out a program proper comprising 
instructions, which are found in the form of words in the central 
memory RAM with extracts in the associated cache memory MCj. On 
these instructions of the program, the processor CPU, is brought 
either to read the words of data which themselves are found in 
the central memory RAM or in the cache memory MCj in the form of 
. extracts, or to write the words of data in the central memory RAM 
and in the cache memory MCj. 

Each operation of a processor CPUj called a "request") 
requires the furnishing of the address adr of the word concerned, 
the nature- r,w of the operation (read, write), and the data 
exchange of the word coiicerned. 
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Each request of a word activates the processor PG, which 
then consults in a conventional manner the directory of the cache 
RGj which indicates if the block bi containing the word concerned 
is present in the cache memory MC, at, in some cases, the frame 
of the block in the cache memory where ,the block searched for is 
found. 

If the block bi containing the word concerned is in the 
cache memory MC,, then in case of reading, this word is read in 
said cache memory and sent to the processor CPU-. in case of 
writing, the word furnished by the processor CPUj is written in 
the cache memory: the memory transaction is terminated. 

If the block containing the word concerned is not in the 
cache melaory MC,, then a reading of the block bi into the central 
memory BAM is necessary. Two cases may be produced, 
first C^se 

The cache memory MCj makes use of at least one free block, 
determined by the proceissor PG, with the help of status bits 
associated with each input of the directory RGj. . In this case, 
the processor - PGj is requesting, in a conventional manner, the 
bus BUSA while referring its demand to the bus arbitrator AB. 
This latter reconciles, in turn, the bus BUSA to the processor 
PGj which agrees while writing to the central memoiry RAM, the 
block read in the memory being loaded into the register RDMj, 
identified by the origin j of the call. The end of the reading 
cycle translates by the liberation of the bus BUSA and the 
activation of the transfer with the series link LSj permitting 
the transfer of the contents of the memory register RDMj in the 
processor register RDPj. The end of the transfer activates the 
writing in the cache memory MC, of the contents of the processor 
register in the placement of the block reserved for this effect- 
and the transaction may terminate as before. 
Second case 

The cache memory MCj is- not arranged for free placement 
then, by a conventional algorithm, a placement of the cache is 
made a candidate for receiving the block requested. Two 
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situations may be encoiintered: 

The block contained in the candidate position has not been 
modified since its installation: it is simply eliminated while 
freeing the frame of the block by a simple writing of a status 
bit in the directory (RGj) and the transaction may proceed as 
before* 

The block contained iri' the. candidate position has been 
modified and an operation of the central memory is required. 
For this fact^ the management processor PGj transfers the 
candidate block to the processor register RDP,, activates the 
transfer of the processor register RDPj to the memory register 
RDM,, then requests the common bus BUSA while referring its 
request to the arbitrator AB. When the arbitrator admits the bus 
to the management processor PGj, the latter activates a write 
command which has the effect of transferring the contents of the 
memory register RDMj to its position in the central memory RAM. 
The operation of the memory RAM is terminated and the transaction 
may continue as before. 

Thus, in the apparatus of the invention, the exchanges 
between the treatment processors CPUj and their cache memory MCj 
and associated logics RGj, PGj, are carried out in a conventional 
manner. By contrast, the transfers of blocks between central 
memory RAM and cache memories MC, proceed, no longer by a common 
parallel bus, but by the series links LSj dedicated to each 
treatment processor CPUj, the common bus BUSA only serving for 
the. transfer of the addresses and having thus a considerably 
reduced traffic. 

It is known that, for conventional common bus architectures, 
a modelization studied by "PATEL" ("Analysis of Multiprocessors 
With Private Cache") JANAK H. PATEL, IEEE Transactions on 
-Computers,. Vol C.31, No. 4, April, 1982) has led to the following 
approximate formula giving the efficiency U as a function of the 
nuinber of processors present: 

1 

u 

1 + m (W+tf ) 
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where the efficiency U is the average rate of utilization of 
each treatment processor, 

m is the probability for a treatment processor to make a 
memory request, not present in its the cache memory (this 
probability in = .Pa is proportional to the. probability of the 
absence Pa of the information in the cache memory and a factor 6C 
which is a function of the power of the treatment processor 
expressed as a percentage of the memory requests) : 

W is the average waiting time of the common bus, which is 
a function of the number of processors, 

*tf is the transfer time of a block from the central memory 
to a cache memory. 

The hypotheses from which this formula has been established 
show that it is applicable to the architecture according to the 
invention, with a level of approximation comparable tp the level 
of approximation of the formula for conventional common bus 
architecture. 

It is thus possible to compare the performances of the 
two types of architecture while assuming that the components 
common to the two architectures are of identical characteris- 
tics. 

FIGURE 2 gives the curves obtained of the efficiency U as 
a function of the number n of processors for the following 
parameters, the parameters common to the two systems being 
identical, and all of a usual value: 

size of block bj 64 octets, 

size of word for the parallel transfer on common bus = 4 
octets , 

central memory RAM access time = 100 nanoseconds, 
- cycl6 time of the bus BUSA = 50 nanoseconds, 
series transfer frequency « 500 Mhz, 

probability of absence Pa = 0.005 (cache memory of 16 
octets) , 

power factor of the processors: = 0.5, 

It is established by comparing the curves A (architecture 
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of the invention) and B (conventional architecture) that the 
architecture according to the invention has an efficiency 
clearly superior to the conventional architecture. The ar- 
chitecture of the invention permits putting in place a number 
of processors very superior to the conventional conmion bus ar- 
chitecturo which, in practice/ cannot exceed about ten proces- 
sors. For example, in the conventional case, an efficiency of 
0.75 is obtained at the tenth processor, whereas it is 
obtained for more than 80 processors in the case of the 
invention. 

FIGURE 3. presents an embodiment of a transfer logic TFR, 
or TFR' J permitting the transfer of a block bi of information 
from a memory register RDMj to a processor register RDPj (the 
inverse transfer is assured by the symmetric means not shown 
in this figure). Each logic TFR, or TFR'j comprises a part for 
control of the output TFREj and TFRE'j, and a part. for control 
of the reception TFRRj and TFRR'^ which are activated in a 
crossed manner (output TFREj activated in synchronism with the 
reception TFRR'j). The system comprises a clock generator H 
the frequency of which fixes the speed of transmission and 
furnishes the clock signal h at the output part TFREj and at 
the reception part TFRR'j. 

In the output part TFREj a countdown register DC receiv- 
ing at its load input loadj the reading signal r from the 
management processor PG, permits allowing to pass t + i pulses 
of the clock h through a logic port ETl controlled by a pass 
to zero signal "borrow" , the output of this port ETl being 
connected to the input of the "down" counting of the countdown 
device DC and to the shift input shiftl of the memory register 
RDM,. .... 

In the reception part TFRR'j, a flip flop b is connected 
by its data input D to the series output of the processor 
register RDPj, the clock input elk of this flip flop being 
connected to the clock *H for receiving the signal h. An 
initialization signal "Uiit" furnished by the management 
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processor PGj is connected to the input s of the flip flop b 
and to the loading input lS5d3 of the processor register RDR. 
The_output Q of the flip flop transmits a control signal ^ 
transfer to the logic port ET2, permitting the clock signal h 
to pass toward the shift input shift2 of the processor 
register RDPj. This control signal is also delivered to the 
management processor PGj for indicating the end of transfer of 
the block. 

The operation of the assembly is as follows: the manage- 
ment processor PG,, after having obtained access to the central 
memory RAM via the bus busa, carried out its reading of the 
block bi While furnishing the address of the block concerned 
and the reading signal r. This signal releases the activation 
Of the output part TFRE^:. the final front of the reading 
signal r provokes the loading of the block bi into the memory 
register RDM, while activating the signal l^i and the loading 
Of the value t + i, corresponding to the size in bits of the 
block bi plus one supplementary bit called "start" , into the 
countdown register DC by the sig nal lb iS2. This has for 
effect to reset to l the signal bS^Fo^ and authorize the 
transfer clock H to furnish, through the logic port ETl 
conditioned by this borrow signal, t + i clock pulses h: 
these pulses have for effect to shift by the input shiftl t + 
1 bits from the memory register RD»^ and to cause to wait for 
the value 0 from the down input to the countdown device DC: 
the signal borrow is reset to ^ero and locks the operation of 
the output part TFREj. 

Thus, the series connection LSj, initially having the 
logxc rest state 1, transfers the start bit o, then the t bits 
Of the blocJcbi,- and then returns to the logic rest state 1 
the latter bit sent being the value l forced on the series 
input of the memory register RDl^. 

AS a preliminary to the reading demand, the management 
processor PGj has initialized the receiving part TFRR'. while 
activating the init signal which has the effect of loading the 
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proce ssor register RDP, with t bits to 1 through the input 
load3 and to set the output Q of the flip flop B to the logic 
state 1 by the input s. This, output Q then validates the 
logic port ET2 which allows the clock signal h to pass toward 
the input shift2 of the processor register RDPl. At each, 
clock pulse this processor register furnishes a bit on its 
series output which is stored in the flip flop B. The first 
bit 0 which is presented has the effect of setting to zero the 
output Q of the flip flop B and locking the clock signal h on 
the gate ET2. This first bit 0 being the start bit "which 
precedes the "block hi, this latter is then trapped in the 
processor register RDP, when the management processor PG, is 
notified of the change of state of the flip flop B by the end^ 
transfer signal: the management processor PG, need only come 
to read this block bi oh the parallel output of the register 
RbPj. 

The writing of a block bi to the central memory RAM 
requires the presence of a logic TFRE'j, identical to the logic 
TFREj, associated with the processor register RDP,/ and a logic 
TFRR,, identical to the logic TPRR'j, associated with the 
memory register RDl^. in this case, the signal init of the 
logic TFRR, is connected to the writing signal w: the 
liberation <Sf the memory register RDM, automatically rearms the 
receive logic TFRR,. 

This embodiment of the transfer control logic is only one 
possible example: the transmitter register may be in per- 
manent shifting itself, and the receiver register activated 
for t pulses of the clock oh detection of the start bit at the 
beginning of transfer. 

The clock H may be -connected to two registers, or two 
local independent clocks may be used, the synchronization 
being obtained in a conventional manner by an introduction of 
synchronization. 

- The system shown in FIGURE 4 comprises a divided memory 
shift register RDI^ and ROMj, a divided processor shift 
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register RDPlj and RDP2j, two unidirectional series links LSij 
and LS2j, one connecting the memory register RDMl to the 
processor register RDPj in such a manner as to transmit the 
contents of the first to the second, the other connecting the 
memory register RDH2j to the processor register RDP2, in, such 
a . manner as to transmit the contents of the second to the 
first, and the associated logics for the control of the trans- 
fer: TFRElj for RDMl; TFRR2j for RDM2; TFRE2j for RDPgj; TFRRlj 
for RDPlj. 

For reading a block of information bi into the central 
memory RAM, the ' management processor PG, initializes by the 
signal init the logic TFRRlj associated with the processor 
register RDPjl, then activa1:es its demand to read to the memory 
RAM by the read signal r. This signal, activates the logic 
TFRElj associated with the memory register RDMlj: this assures 
the transfer on the link liSlj of the block bi of information. 
The end of the transfer is detected by the logic TFRRlj 
associated with the jprocessor register RDPlj which notifies 
the management processor PG, of the arrival of the block bi by 
the signal end-Jbransfer. The management processor; PGj thus 
transfers the contents of the processor register RDPlj into 
the cache memoary MCj. 

. For writing a memory block bi, the management processor 
PGj loads the processor register RDP2 with the block b; con- 
cerned extracted from the cache memory MCj, which activates 
the transfer of this block on the link LS2j. The transfer 
logic TFRR2j associated with the memory register RDM2j assures 
the good reception of this block. The management processor 
PGj is notified of the end of transfer by the change of state 
of the signal borrow issued from the transmission logic 
TFRE2j. The management processor PGj then carries out its 
writing request which becomes effective at the time of 
activation of the writing signal w. This has the effect of 
transferring the contents of the register RDM2j into the 
central memory RAM and to reinitiate for a next transfer the 
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logic TFRR2j. 

This system authorizes a simultaneous transfer of blocks 
in two directions and permits treating more rapidly the 
defects of blocks bi in the cache memory MCj when the latter 
is saturated. It authorizes also the operation of a conven- 
tional mechanisiB for anticipation of the reading of blocks. 

In another embodiment shown in FIGURE 5, the connection 
LSj comprises a single bidirectional link provided at each 
extremity with a validation logic LVl and LV2 constituted by 
a logic gate with two open connector inputs OCl and 0G2, one 
of the inputs being connected to the series output of the 
memory register RDM, for the gate OCl and the processor 
register RDPj for the gate OC2, the btheir input being con- 
nected to the output Q of a control flip iElop BCl and BC2; 
each of these is connected by its inputs S and R to the 
transfer logic TFR for the flip flop BDl and TFR« for the flip 
flop BD2* 

Readings and writings are carried out in an exclusive 
manner, at the sole initiative of the management processor PGy. 

A reading memory activates the reading signal r which 
causes the setting to 1 of the flip flop BCl by its input S, 
the resetting to zero being controlled, on the input R by the 
transfer logic TFR at the end of the transfer of the block. 

A writing memory releases a mechanism identical to the 
validation logic LV2. 

Other combinations of registers/ links are possible, and 
in the case of a iDidirectional link, bidirectional shift 
registers may in particular be used, receiving a signal of the 
direction of transfer- This solution leads to the use of 
shift registers more cpmplex in logic, therefor a priori less 
efficient in speed of transfer. 

The speed of transfer coming to be very high, the shift 
registers RDl^ and RDPj, their associated control logic TFR and 
TFR', the validation logics LVl and LV2, are selected from a 
rapid technology (ECL, ASGA) , and synchronized by a clock of 
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a frequency F at least equal to 100 MHz. 

Another solution with multiplexed registers shown in 
FI6USE 21 permits, as will be understood below, considerably 
reducing the quantity of efficient, and therefore costly logic 
necessary. 

The multiprociessor of FIGURE 1 was provided at the same 
time with a common communications bus for block addresses and 
data transfer series links. FIGURE 6 shows, as a variation, 
a multiprocessor system of the same general principle, but in 
which data and addresses transfer by the series connections, 
in the absence of a common bus. 

This system comprises, besides the memory registers RDM,, 
complementairy shift registers RDC, able to store the addresses 
of blocks called for and controlled by a logic of the type 
TFR,. Further^ an access management arbitrator ABM is con- 
nected to the central memory RAM and to the complementary 
registers RDCj by their parallel output. Each logic TFR, is ' 
connected to this arbitrator ABM of conventional stiructure. 
The management processor PG, of each cache memory MC, is 
connected to one part of the parallel input of the processor 
re9ister RDPj, in order to have access thereto in writing. 

For reading a block bj in central memory RAM, the manage- 
ment processor PGj places the address of the block called for 
and the nature of the request (by a prefix bit: 1-read, 
0=write) in the part of the register processor RDPj to which 
it is accessible, which has the effect of initializing the 
transfer of this information. The transfer logic TFR, detects 
the end of the transfer on the complementary register RDCj and 
activates a demand operation toward the arbitrator ABM. This 
is loaded for serializing and treating the read requests of 
the block in the central memory RAM while going to read the 
address of the block requested in the complementary register 
RDCj corresponding to the transf r logic selected by the 
arbitrator ABM, then in going to read the block in the central 
memory RAM which will then be loaded into the memory register 
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RDMj and transmitted as before. 

For writing a block in the central memory RAM, the 
management processor PGj chains the transmission of the address 
then of the block to be written through the processor register 
RDPj. The, complementary register RDCj receives at once the 
address and the nature of the request. 

The transfer logic TFRj analyzes this rec[uest and vali- 
dates the reception of the block in the memory register RDl^ 
from the fact of the nature of the request (write) . The 
transfer logic TFRj is notified of the end of the transfer of 
the block b| and then transmits its request for service to the 
arbitrator ABM. This request is treated, in turn, by the 
arbitrator which activates the writing of the block bi into 
memory. 

Further, the multiprocessor system shown in FIGURE 7 
comprises means for management of the shared data, permitting 
treating, in a static manner, the classical problem of 
maintaining the coherence of the shared data. This system 
comprises the resources of the system of FIGURE 1 (with the 
same reference characters) with the following logic and 
supplementaxy resources: 

A special parallel commimication bus for words BUSD 
connects the processors CFUj and the central memory RAM. A 
partition logic LPj is associated with each processor CPUj. 
Each logic LPj is constituted in a conventional manner by an 
assembly of coupled register-comparators connected in parallel 
on the address bus adr of the processor CPUj, in order to 
provide a division of the memory space of the central memory 
RAM in the zone of non-shared data and shared data, said logic 
LP, delivering to this effect. a signal p (indicating thie nature 
of the data, shared or not). A decoding logic DEC is assoc- 
iated with the central memory RAM, itself arranged to be 
controlled by a writing by word or by block by said logic DEC. 

The decoding logic DEC is detailed* in FIGURE 8 and com- 
prises a decod r DECL, receiving on its data input the address 
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portion word adrm of the address adr, and connected through 
its validation input to the output of a logic gate ET3, each 
output i of said decoder being connected to a validation 
buffer of the type BFS,. The logic gate ET3 receives on its 
inputs the signal p and the inverted signal r. A decoder DECE 
is connected through its validation input to the output of a 
logic gate ET4, its outputs being connected to an assembly of 
logic gates 001, of a number equal to the number of words in 
a block* The logic gate ET4 receives on its inputs the signal 
p and the inverted signal w. The output of the gate ETA is 
also connected to an assembly of . validation input buffers 
BPEi, BPEi...* The central memory RAM may be controlled in 
writing by word. Each clipped word thus defined has its 
writing control input Wj. The output of each logic gate 0U1| 
is connected to the input wl of each clipped word of the 
central memory RAM. 

FIGURE 8 shows further the detail of the addressing of 
the memory tegisters RDM|, which comprises in the first place 
a decoder DECEB connected through its data input to the common 
bus BUSA, in order to receive the nuxaber j of the processor 
concerned by the request of the central unit UCj. This decoder 
DECEB is connected through its validation xnput to the output 
of a logic gate ET5 and through its outputs 2... j to the 
validation buffers BVl, BV,.... The logic gate ET5 receives 
on its inputs the signal p and the inverted signal w. In the 
same manner, a decoder DECIiB is connected through its data 
input to the field j of the common bus BUSA and through its 
outputs 1, 2..-J of this decoder DECLB are connected to the 
loading inputs Idl^ Idj of the memory shift registers RDM,. 
The logic gate ET6 receives on Its inputs the signal p and the 
inverted signal r. 

The operation of the system is as follows: at each 
reference memory, the processor CPUj provides an address on 
its address bus adr, and the nature of the request : reading 
r or writing w. It waits for a datum in case of reading and 
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furnishes a datum in case of writing. The address adr 
traverses the partition logic LPj, which indicates, by the 
signal p, if the address adr belongs to a zone, of non-shared 
data (p=0) or of shared data (p==l) . In the first case^ the 
request is directed to . the management processor PGj and is 
treated according to the mode of operation described in 
reference to FIGURE !• In the second case, the request is 
directly sent through the common bus BUS A. The bus address 
adr permits supplementary address streams permitting iden- 
tification of the word concerned: the address adr is com- 
prised of a block address part adrb . and a word address part 
adrm. Thus, after agreement of the bus arbitrator AB, the 
central memory RAM receives either a bldck transaction request 
(p«0) and in this case^ only the block pairt adrb of the 
address adr is significant, or a word transaction request 
(p»l) and, in this case, the whole address adr (block adrb and 
word adrm) is significant. 

In case of a reading block, p»0 and r»0, the logic gate 
ET^ validates the decoder DECLB which delivers a loading signal 
LDj to the shift register RDM,, permitting loading into the 
latter the block read into the central memory RAM with the 
address adrb by the reading signal r. 

In the case of a writing block, p=0 and w=0, the logic 
gate ET5 validates the decoder DECEB which delivers a valida- 
tion signal to the buffer BV,, permitting the contents of this 
register to be presented to the central memory RAM and to thus 
be written to the address adrb, the output of the logic gate 
ET5 providing the block writing signal. This latter is 
broadcasted on the writing inputs wl, Wj, . . . to the clipped 
words of the central memory RAM ^across the logic gates OUl,. - 
In case of word reading, p=l and r=0, the logic gate ET3 
validates the decoder DFCL which delivers a validation signal 
to the buffer BFSj, permitting the requested word (address 
adrm in the block adrb) of which the reading is assured by the 
signal r, to be directed toward the special communication bus 
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BUSD. This word is recovered directly by the processor CPUj 
on its data input data. 

In case of writting word, p=l and w^O, the logic gate ET4 
validates the decoder DECE which furnishes on its output ^ a 
signal directed across the Ijogic gate 0U1| toward the writing 
input Wj of the clipped word of the central memory RAM con- 
cerned. This isignal present at the input Wj permits writing 
in this sole clipped word the word furnished by the processor 
CPUj on the data bus BUSD. The contents of this bus are 
presented in parallel on all the clipped words of the central 
memory RAM, due to an activation of the buffers BFE, by the 
signal emitted by the logic gate ET4. 

An essential characteristic of the architecture of the 
invention is to present a minimiam load of requests on the 
common bus BUS A, In the architecture shown schematically in 
FIGURE 7 , the common bus BUSA is driven by the block addresses 
and the word addresses. The frequency of the word address 
requests is a function of the rate of shared data and may lead 
to a saturation ' of the common bus BUSA. 

FIGURE 9 shows a variation of a solution for reducing 
this load. The system provided comprises, in addition to the 
resources of FIGURE 7, a bus BUSAM for the word addresses, an 
arbitrator AB« for arbitrating the access conflicts to the bus 
BUSAM, an arbitrator ABM for arbitrating the access conflicts 
in the origin of the busses BUSA and BUSAM, and connected to 
a multiplexer MUX which in turn is connected through its 
inputs to the two busses BUSA and BUSAM. 

The operation of this system is as follows: 

If the request concerns non-shared data (p=0) , any lack 
of information generates a memory request of the block type 
which transfers through the common bus BUSA. 

If the request relates to shared data (p«l) , the request 
is directed toward the common bus BUSAM. Thus, the central 
memory RAM may receive simultaneous requests on the two busses 
BUSA and BUSAM, which must then be arbitrated. The arbitrator 
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allows, in a conventional manner, access to the central 
memory RAM by one of the two requests and reconstructs the 
signal p from, the origin of the request (p-O for BUSA., p»l for 
BUSAM) . The signal p thus controls both the multiplexer MUX 
.which allows the signals from the bus concerned to pass 
through the request,, and the decoding logic DEC: one is thus 
situated as in the preceding system. 

It will be noted that the load is shifted from the common 
bus to the central memory RAM, since the rate of recpiest at 
the level of this latter remains the same, and its cycle time 
is of the same order of magnitude or even greater than that of 
the bus cycle. 

This solution is therefore only of interest if the 
central memory RAM comprises central independent memory banks 
organized according to the description given later with 
reference to FIGURE 20: several transactions may in this 
case, if they affect different memory banks, take place 
simultaneously. 

FIGURE 10 presents a block diagram of an embodiment of • 
the architecture according to the invention, in which the 
problem of shared data is treated in a dynamic fashion. To 
this end, the system conforming to this embodiment comprises 
an snooper processor with a bus PEj, coupled to a management 
processor with a parallel link I>GPj. A management processor 
with a series link PGSj is connected to a snooper processor 
PEj by a waiting line FIFOj. A management processor for 
requests from the central unit PGUj is connected, first to the 
treatment processor CPUj, and to the parallel link management 
processors PGPj and the series link manager PGSj. .The logic 
- corresponding to the management processor PGj of each cache 
memory is in this embodiment split into various processors 
presented hereabove. The access to the cache memory MCj and 
to its directory RGj is controlled by a management processor 
with a directory and a cache PGR,, 

Finally, a management processor PO! of the central memory * 
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RAM is connected to the bus BUSA and to the central memory 
RAM, and to its shift registers RDM,. 

The functioning of the assembly is as follows: 
Each transaction on the common bus BUSA corresponds to a 
reading or writing request of the block b,. The snooper 
processors of the bus PEj are activated by each reading request 
of the block of data. This operation provided in the same 
cycle through all of the snooper processors is going to permit 
protecting the uniqueness of the value of the shared data. 
The snooper processor PE^ arranges an access to the directory 
RGj. The function of the application used for the management 
of the cache memory MC, is in the described embodiment of the 
direct application type. Each element of the directory is a 
descriptor of th$ block which contains a "tag" field (address 
of the block) , conventional status bits of the block: a 
validation bit v and a modification bit m and two supple- 
mentary bits a for noting that the block is known by the cache 
memory but still in the course of transfer on the series link, 
f for indicating that the block is in the waiting line FIFO 
and thus avoiding its being placed several times. 

The management processor memory PGM arranges a waiting 
line AFIFO of addresses of blocks bi and of addresses of 
processors, accessible in ah associative manner, and a. status 
directory of blocks constituted by 2 bits per block ro arid rw 
indicating the possible states of the following block: 
ro = rw = 0: block not yet broadcast, 
ro = 1; rw = 0: block already broadcast in reading: 
one or several copies of this block being found in the cache 
memories . 

- The evolution of block status bits is the following, 
different according to the nature of the request of the treat- 
ment processor CPU,: 

If the processor CPU, makes a request for reading 
non-shared data (program space or explicitly non-shardd data) : 
the block is marked already broadcast in reading (ro « i; rw 
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=« 0) central memory part at the time of transfer of said block 
from the central memory RAM to the register RDI^ and marked 
non-modified (m = 0) , cache memory part in the same cycle of 
operation of the directory RGj of the cache memory (valid 
block) . The snoopers have not reacted to the request on the 
common bus (the request having been made with the indication 
"shared" or "non-shared") . 

If the processor CPUj makes a request for reading 
data (a priori shared) , the common bus BUSA is occupied for 
the time of passage of the information of addresses and of the 
type requested, the time of their treatment by the processor 
PGM and the snoopers of the common bus PEj. In the central 
memory RAM, this block may be: 

1. Not yet broadcast: ro » rw = o. It is then 
transmitted to the central unit UC^ and takes the non-modified 
state. 

2. Already broadcast in reading: ro «> l; rw » 0. 
It is then transmitted to the centnral unit UC,. Its state 
does not change. 

3. Already broadcast in writing: ro = 0; rw » 1. 
The open copy of this block is found in a cache memory MCi. 
The snooper processor associated with this cache memory has 
noted the request of the central unit UC, at the time of 
passage of the address on the common bus and has attempted its 
transfer to the central memory RAM as soon as possible on the 
series link LS,. While waiting its effective transfer, the 
memory management processor PGM places the request while 
waiting in the associative waiting line which comprises a 
number of elements equal to the number of processors. 

At the -time ot the reading request on the • common bus 
BUSA, all the snooper processors PE| have reacted by consult- 
ing the directory RG, associated with their cache memory Mq. 
The common bus BUSA is only freed when all the snooper 
processors PEj have had their access to the management direc- 
tory RG,, which assures the same state of the block in' the 
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entire system. The processor which has the current copy in 
its cache memory carries out, as soon as its series link is 
free, the transfer of this block into the register RDM| and 
makes a writing request for the block on the common bus which 
will have the effect to free the request while waiting in the 
associative file AFIFO and. carrying out the operation of the 
status bits of the block. 

. The operation of the block only needs a writing to the 
central memory RAM without activation of the snooper. 

If the processor CPUj requests the writing of a datum in 
a block present in its cache memory MCj with a non-modified 
state, an informative writing request must be emitted on the 
common bus BUSA as it is possible that other cache memories 
Mq have this block with the same status. These other memories 
must be informed of the change of state. To this end, all the 
snooper processors PE| (activated by the informative writing 
broadcast on the common bus BUSA) consult their management 
directory and invalidate this block, while the central memory 
notes in the same time the change of state of this block as 
well as the parallel management processor PGPj, in the manage- 
ment directory RGj. The liberation of the common bus BUSA by 
all the snooper processors and the central memory RAM permits 
the processor CPUj to provide the writing in its cache memory 
MCj, the broadcasting of the status bit of the management 
directory RGj having been carried out. 

If one central unit is awaiting access to the bus BUSA 
for the same request on the same block, its request is 
transformed in simple writing and then follows the protocol of 
the request of the block in writing. 

If the processor CPUj requests writing ^of a datum in a 
block absent from the cache memory MC,, this block is read 
into the central memory RAM and brought into the cache memory 
MCj in order that the writing will be carried out effectively. 
In the central memory' RAM, this block may be: 

1. Not yet broadcast: ro rw = 0. The block is 
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then emitted on the series connection LSj to the cache memory 
MCj. It takes the states ro = O; rw = 1 in the central memory 
and the modified state (m=l) in the cache memory, 

2. Already broadcast in writing: ro = i; rw = 0. 
The block, is sent on the series connection LSj to the cache 
memory MCj. It takes the states ro » 0, rw = 1 in the central 
memory, and the modified state (m «» l) in the cache memory. 
At the time of a recpiest on the common bus BUSA, the {espion) 
processors PE, have noted the request and invalidated this 
number in the block in their cache memory Mq. 

3. Already broadcast in writing: ro = Q; rw = i. 
The request is placed in the associative waiting line APIFO 
and the common bus BUSA is liberated. 

The snooper processor PEj of the cache memory. MCj, winner 
of the current copy/ activates as soon as possible the 
transfer of the block requested from its cache memory MC, to 
the central memory RAM. This block is then invalidated in the 
cache memory MC,. 

The Central unit UCj makes a writing request to publish 
the block in the two following cases: 

a) the cache memory is saturated and the purging of a 
block requires sending this block into the central memory, 

. b) a central unit UC, is awaiting a block the only 
current copy of which is found in the cache memory MC,-. The 
snooper processor notes the request and provides as soon as 
possible the purging of this block. 

The central memory side RAM, each request for a writing 
operation entails a consultation with the associative waiting 
line APIFO, and in the case of discovery of a central unit uq 
awaiting this block, the- loading of this block into the shift 
register RDM, and the publishing of the status bits cor- 
responding to this block. This type of writing request does 
not drive the {espion} processors. 

. The directory management processor PGR, which is added in 
this embodiment, permits the execution of the algorithm men- 
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tioned above, while coordinating the accesses to the directory 
of the cache memory MCj which receives the requests of three 
asynchronous functional units: 

1. The treatment processor CPOj, in order to read 
the instructions of the program during execution and to read 
or write the data manipulated by this program, 

2. The common bus BUSA, in order to maintain the 
coherence of the data in the cache memoiry HCj, 

3. The series link LS„ in order to load/un-load a 
block of information from/ to the central memory H2^. 

Each of these requests arrives at the management direc- 
tory RGj of the cache memory. The serialization of these 
accesses on said management directory permits assuring the 
good operation of the algorithm mentioned above with coherence 
of information in the cache memories. Thus there is. obtained 
a strong coupling of the requests at the level of the manage- 
ment directory RGj but the synchronization which must exist at 
the level of the treatment of these requests is sufficiently 
low to consider an asynchronous functioning of the treatment 
logic for these requests, which leads to the following 
functional segmentation: 

The interface of each processor CPUj and of its auxil- 
iaries (PGSj, PGUj) with the common bus BUSA is composed of two 
parts having a mutually exclusive functioning: the management 
processor for the parallel link PGP, is loaded to request of 
the common bus BUSA to the request of the request management 
processor PGUj or of the management series link PGSj and to 
control the common bus BUSA in writing. The snooper pro- 
cessor - of the bus PEj assures the snooping function, which 
returTis to control the common bus BUSA in reading. • It 
frequently reaches the directory RGj of the cache memory MCj. 

The management processor of the series link PGSj manages 
the interface with the series link LSj. It assures the loading 
and unloading of . blocks of information bi upon request of the 
request management processor PGUj and of the snooper processor 
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of bus PEj. It reaches less frequently the cache memory MCj 
and the corresponding management directory RGj. 

The request management processor PGU, assures the follow- 
ing of the requests issued from the processor CFUj. It reaches 
very frequently, into the cache memory MCj and the management 
directory RGj« This, interface includes possible "MMU" logic 
(memory management unit) usually associated with the treatment 
processor CPOj. 

The management processor PGI^ of the management directory 
RGj is the arbitrator charged with granting access to the cache 
memory MCj. 

FIGURES 11, 12, 13, 14, 15, 16 and 17 show by way of 
exeuaples the eiobodiments of the various functional units of 
the system of FIGURE 10. The designations of signals or 
inputs and outputs of these units are chosen in a conventional 
manner. The signals of the same functionality which are 
generated in each functional unit from a base signal will be 
designated by the same reference, for example: dnp - non- 
shaxed data, dl = reading request, maj « updating, de « demand 
writing, = informative writing. The system has several 
processors and the indicia used heretofore refers to a 
current processor and its auxiliaries; for alleviating the 
description, this indicia has been omitted in these figures 
and it will be understood that the description which follows 
is directed at each of the functional units which are con- 
nected with each treatment processor. Further, the signals 
denoted x_YZ define the name and the origin of the signal in 
the case where YZ =* RG, MC, UC, and the source and the 
destination of the signal in the other cases, with Y and Z 
representing: U « PGU, R « PGR, P « PGP or PE, S - PGS. 

The cache memory MC shown in FIGURE 11 has, for example, 
a capacity of 16 K . It is organized in 16 modules of high 
speed read write memory of 1 k MCO, ••.MC15, each accessible 
on a front of 4 octets: the address bus of the cache memory 
MC (noted adr_MC) comprises a partial block address adr_block 

35 • 



and a partial word address in the block adr_mot. The address 
bus adr^MC cc»nprises 14 lines, p ntiitting addressing the 16k 
of the cache memory MC. The partial adr_bloc comprises 8 
lines permitting addressing the 256 block sites in the cache 
memory, and the adrjnot part 6 lines permitting addressing a 
word in the block the size of which is, in the example, 64 

octets. . ' . 

The address part adr_bloc is connected to the adress 
input of each of the memory modules MCq.-.MCj^j. The word 
address part adrjnot is connected to the input of two decod- 
ers DECO and DECl (only the 4 significant bits of the address 
bus adrjnot are used : the address is an octet . address and 
the cache has an access unit which is a word of 4 octets) . 
The read signal rJiC is delivered to each of the read inputs 
of the memory modules MCQ-.-MCj^g and to one of the inputs 
of a logic gate OUl. The other input of this logic gate OUl 
receives the inverted signal bloc. The writing signal wJlC 
is delivered to one of the two inputs of the logic gates 002 
and 0U3. The logic gate 0U2 receives on its other input the 
inverted signal bloc. The logic, gate 003 receives on its 
other input the signal "EIoc. The output of the logic gate 
001 is connected to the validation input enl of the decoder 
DECl, and the output of digit position i of this decoder DECl 
activates a validation buffer BVL of digit position i. The 
output of the logic gate 002 is connected to the validation 
input enO of the decoder DECO and to the validation buffers 
BVE. The output of -the logic gate 0U3 is connected to the 
logic gsites ETIq. . .ETlj^sr which receive on tiieir other 
input the output of the corresponding logic , position of the 
decoder DECO. The output i of each logic gate ETIq... 
ETlj^g is connected to the ' writing inpout Wq...Wjl5 of 
each memory module MCq...MCj^5. A data bus connects each 
memory module MCQ.-.MCj^g to one of the validation . buff ers 
BVL and to one of the validation buffers BVE. The" output of 
the buffers RVL and the input of the buffers RVE receive in 
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parallel a data bus datamot_MC (connected to the request 
management processor PGO) • 

The operation of the example of the cache memory 
described above is as follows: 
Case 1 

The request comes from the management processor of the 
series link PCS. This cass'^is signalled by the presence of a 
lo^ic state zero on the signal bloc. 

In reading the cache memory, the management processor 
of the series link PGS presents on the address bus adr^MC the 
address of placement of. the block to be read (in this case, 
only the part adr_bloc of the bus adr_MC is used) and activ- 
ates the reading signal^ fJAC. At the end of the access time, 
the block is available on the bus databloc_MC. 

In writing the cache memory, the management processor 
of the series link presents on the address bus adr_MC the 
address of placement of the block to write, on the data bus 
databloc_MC the data to be written there, and activates the 
line wjic. the zero state of the signal bloc directs the 
signal w_MC toward the writing control Inputs of the cache 
memory modules HCq...MC<|^2, via the logic gates (XJ3 and 
ETl. The information presented on the data bus datablocJHC 
is written in the cache memory at the end of the writing; 
Case 2 

The request come from the' request management processor 
P6U of the treatment processor CPU. This case is signaled by 
the presence of a logic state one on the signal bloc. 

In the reading cache memory, the management - presents 
on the bus adrJMC the address of the word requested, and acti- 
vates the reading signal r^MC. The block corresponding to 
the part adr_bloc is read in cache memory r and the word 
requested is directed, via one of the validation buffers BVL, 
toward the data bus datamot_MC. The validation buffer BVL 
concerned is activated by the output of the dedoder DECl 
corresponding to the word address adr_mot requested. 
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In writing the cache memory, the management processor 
PGU presents on the bus adr_MC the address of the word to be 
written, on the data bus datamot_MC the data to be written, 
and activates the writing signal wJ4C. The data presented on 
the bus datamotJMC is distributed on each cache memory mod- 
ule, via the buffers BVE validated by the writing signal. 
The writing signal wJlC is 'then presented to only the memory 
module concerned. It is delivered to the output of the decod- 
er DECO corresponding to the address adr_mot concerned. 

In the embodiment described above, the problems of 
access in an octet and double octet, and of access in double 
octet and a word on both sides of two memory modules are 
resolved in the same manner as in conventional information 
systems emd are not described here. 

FIGURES 12a, 12b, 12c and 12d show, by way of example, 
the dharacteristics of a management directory of the cache RD 
• and of an associated management processor PGR. FIGURE 12a 
illustrates the logic structure of the address adr_RG in the 
hypothesis of a space addressing on 32 bits and with the char- 
acteristics of the cache memory described previously. The 
field -tag-, composing the address block, is coded on 18 
bits. The field -cadre- is coded on 8 bits and permits 
addressing the 256 positions of the bloclc of the cache memory 
MC. The last 6 bits define the address word in the block, in 

an octet unit. 

FIGURE 12b shows the structure of the management 
directory of the cache RG, which is a simple fast read/write 
memory of 256 words of 22 bits. Each word of the address i 
contains the descriptor of the block written, in the position 
i of the cache memory . 

FIGURE 12c shows schematically the structure of the 

descriptor which comprises: 

a field tag of 18 bits, defining the address of the 
block in the position or frame of the current block, 

the validation bit v. 
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the modification bit 

the wait-for-end-of -transfer bit a^ 

the wait-for-purge bit f • 

FIGURE 12d provides the structure of the processor 
PGR, which is nothing other than a conventional arbitrator 
with fixed priority. 

This arbitrator 'comprises a register LATCH, of which 
three inputs .receive respectively . the signals rqit^DR, 
rqst_PR, rqst_SRr also delivered respectively to the logic 
gates ET2, ET3r ET4. The corresponding outputs of the regis- 
ter LATCH are connected to the inputs of a priority encoder 
PRI, the outputs of which are connected to the inputs of a 
decoder DECPRI* The outputs of the digit position correspond- 
ing to those of the register LATCH are connected to the sig- 
nals grnt_DR, grnt_PR, grnt_SR as well as, in an inverted 
manner, respectively to the inputs of the logic gates ET2, 
ET3, ET4, The outputs of the logic gates ET2, ET3, ET4 are 
connected to the inputs of the logic gate NOUl. The output 
of the. logic gate NOOl is connected to a flip flop Bl, which 
receives on its input D the output eO of the priority encoder 
PRI. The assembly of the apparatus is synchronized by a gen- 
eral clock which delivers a signal h to one of the inputs elk 
of a logic gate ET5, and in a reverse manner, to the clock 
input of the flip flop Bl. . The output Q of the flip flop Bl 
is connected to the other input of the logic gate ET5. The 
output of the logic gate EIS is connected to the input load 
of the register LATCH, 

The operation of this ^arbitrator is a$ follows: - in 
the absence of any request on the lines rqst, the flip flop 
Bl stores permanently the state of the line eO, inactivates, 
and thus validates through the logic gate ET5 the loading of 
the register LATCH. 

The arrival of a signal rqst provokes the latching of 
the clock and the activation of the signal grnt associated 
with the signal rqst, until deactivation of- the latter: the 
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arbitrator is fixed in its state during the entire transac- 
tion in progress. 

The request management processor PGU, represented in 
FIGURE 13 r constitutes an interface between the treatment 
processor CPU and: 

the various processors with which it must exchange 
^ information: paralllel management processor PGPr series 
management processor VGS, directory management processor PGRr 
and 

the mcinagement directory of the cache memory RG and 

the cache memory MC. 

The treatment processor CPU unlatches the activity of. 
the request management processor PGU while activating the sig- 
nal as ("address strobe**) • This signal validates the address 
bus adr_CPUr the reading signals r_CPU and the writing sig- 
nals w_CPU as well as the function lines f c_CPO of the treat- 
ment processor CPU. The treatment processor CPU is then 
placed in a waiting state until discharge of the request by 
the signal dtack^CPO . 

The signal as is connected to the input of a different- 
iator circuit DIO. The output of this circuit is connected 
to one of the three inputs of a logic gate ET12, the two 
other inputs receive respectively the signals ack_US and 
ack_UP. This latter signal is also delivered to the input R 
of a flip flop B13. The input F of the flip flop B13 
receives the output of a logic gate NETIO. The output of the 
logic gate ET12 is' connected to the input S of a flip flop 
Bllr at the input S of the flip flop BIO and at the input 
clearlO of a shift register SRIO . The output - Q of the flip 
flop Bll furnishes the signal rqst_UR. The flip flop fill 
receives on its input R the inverted phase 813 and the flip 
flop BIO the inverted phase 911. The output Q of the flip 
flop BIO is connected to the series input serial_inlO of the 
register SRIO. The shift register SRlO receives on its input 
clJclO the clock signal *h- and on its validation input enlO 
the signal grnt_UR. 
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The activation of the signal as releases the operation 
of the differentiation circuit DlO. The pulse produced by 
this circuit across the logic gate ET12, sets to the logic 
state one of the flip flops BlO and Bll by their input Sr and 
also carries out the resetting to zero of the shift register 
SRIO by its input clearlO. 

The flip flop BlO and' the shift register SRlO consti- 
tute the logical sub-assembly "phase distributor" DP_U. The 
activation of this phase distributor is released by the sett- 
ing to one of the flip flop BlO and the resetting to zero of 
the shift register SRlO . If the shift register is validated 
by the presence of a level zero on its input enlO , then the 
next pulse of the clock h on the input elk of the shift regis- 
ter produces the shifting of one step of said register. 

The logic state one of the flip flop BlO is propagated 
on the output ©11 of the shift register SRlO by its series 
input serialinlO, The output Qll, called phase Oil, invert- 
ed, resets the flip flop BIO to zero through its input R. 
Thus, one single bit is introduced to the shift register SRlO 
on each activation of the phase distributor DPJD. At each 
clock pulse h, this bit comes to be shifted in the shift 
register SRlO and produce the consecutive disjointed phases 
911, 812 # 913. 

The setting of the flip flop Bll to the logic state of 
one causes activation • of the signal r^tJJR. This signal is 
emitted to the destination of the directory management proces- 
sor PGR. This latter, as soon as possible, comes to accord 
access to the management directory RG and to the cache memory 
MC while activating the signal grntJUR, which, comes to valid- 
ate the assembly of passage buffers BVIO, BVll and BV12 situa- 
ted respectively on the busses of the management directory 
and on the busses of the cache memory. This signal grntJDR 
also validates the phase distributor which then produces 
sequentially the phases 811, 812, 913. 
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The phase 911 corresponds to a time-out permitting 
reading of the descriptor of the block requested by the treat- 
ment processor CPU in the management directory RG, addressed 
by the field frame with the address adr_CPU and connects to 
the bus adr_RG through the passage buffers BVIO . The signal 
f_RG is always active on the input of a buffer BVlO, the sig- 
nal w_RG always inactive on the input of a buffer BVlO . At 
the start of the time oiit, the descriptor is returned to the 
processor PGU via the bus date_RG, The part tag of this 
descriptor and the validation bit v are delivered to one of 
the comparison inputs of a comparator COMPIO ^ the other input 
being connected to the part tag of the address adr_CPU . The 
bit opposite the validation bit v is always at onev The com- 
parator COMPlO is validated permanently by the presence of a 
level of one on its input enll* 

The time of access of the management directory RG and 
the frequency of the clock h are in a ratio such that at the 
end of the phase 611, the output eg 10 of the comparator 
qOMPlO is positibnned and furnished the information "the 
block requested is present in the cache memory or absent from 
the cache memory." 

If the block requested is present in the cache memory 
MC (eglO » 1), then the signal eglO, delivered to one of th 
two inputs of the logic gate ETIO, furnishes a signal calib- 
rated by the phase 012, present on the outher input of the 
logic gate ETIO • 

This calibrated signal present on the output of the 
logic gate ETIO is connected to the inputs o£ the logic gates 
NETIO, NETll, NET12, 

The logic gate NETIO receives on its inputs, in 
addition to the output of- the logic gate ETlQ, the inverted 
srtatus bit m coming from the descriptor, and the inverted 
writing request signal w_CPO* 

The activation of the logic gate NETIO corresponds to 
the state "request writing of a word in a block present in 
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the cache and which has not yet been modified (m = 0)". The 
output of the logic gate NETlO is connected to the input S of 
a flip flop Bl3. The activation of the logic gate NETlO sets 
the flip flop B13 in the logic state of one, which releases 
an informative writing request by the line rqst_UP to the 
management processor of the parallel link PGP* The address 
of the block concerned is furnished by the lines adr_bloc_UPr 
derived from the lines adr^CPU. 

The request management processor PGO has terminated 
the first part of its task: the management directory RG and 
the cache memory are freed by deactivation of the signal 
rqst_URr as a consequence of the arrival of the inverted 
phase 913 on the input R of the flip flop Bll. 

The informative writing mechanism is described in the 
paragraph "management ptocessor of the parallel link PGP"f 
and has for effect to place the requested block in the modi- 
fied state (m - 1) or the invalid state (v ». 0). One will 
note that the liberation of the management directory RG and 
the cache memory MC by .the request mauiagement processor PGU 
is necessary in order that the management processor of the 
parallel link PGP may" have access thereto. The end of the 
"informative writing" operation is signalled to the request 
management processor PGU by the activation of the signal 
aclc^UP, which has the effect of resetting the flip flop Bl3 
to zero and activating, through the logic gate ET12, the flip 
flop B12r the flip flop Bll and the phase distributor: the 
cycle initiall activated by the signal as is reproduced, but 
the consequential sequence of the activation of the gate 
NETlO is not reproduced a second time in this request cycle. 

The logic gate NETll receives on its inputs, other 
than the output of the logic gate ETIO, the status bit m from 
- the descriptor, and the inverted signal of the . writing 
request w_CPO\. The activation of the gate NETlI corresponds 
to the state "writing request in a block present in the cache 
and which has already been modified. The output of the gate 
NETll is connected, through one of- the buff ers BVll,, to the 
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writing signal w_MC of the cache memory MC. This signal 
permits writing in the cache memory, at the address present 
on the bus adr_MC, connected to the bus adr_CPU, via the 
buffers BVll, the data present on ths bus data_MC, connected 
to the bus data_CPU via the bidirectional buffers BV12 . The 
direction of activation of these buffers is furnished by the 
signal w_MC, 

The output of the gate NETll is also connected to one 
of the inputs of the logic gate ETll, which thus re-^sends the 
signal dtack_CPU to the treatment processor CPU. The writing 
operation in the cache is done in parallel with the activa- 
tion of the signal dtacJc^CPO , which is conformed to the usual 
specifications of treatment processors. 

The operation is terminated by freeing of the manage- 
ment directory R6 and of the cache memory MC by deactivation 
of the signal rqst^UR, as a consequence of the arrival of the 
inverted phase W3 on the flip flop Bll. 

The logic gate NET12 receives on its inputs, in addi- 
tion to the output of the . logic gate ETIO, the inverted read- 
ing request signal f_CPO. The activation of the logit gate 
NET12 corresponds to the state "request reading of a word in 
a block present in the cache." 

The continuation of the operations is identical to the 
preceding operation, with the sole difference of the activat- 
ed signal (f_MC rather than w_MC) associated with the direc- 
tion of transit of the data on the busses data^CPO and 
data_MC . 

If the block requested is absent from the cache .memore 
(eglO = 0), then the signal eglO, inverted, connected to one 
of the two inputs of the logic gate NET13, furnishes a signal 
calibrated by the phase 012, present on the other, input of 
the logic gate NET13, The output of the logic gate NETll is 
connected to the input S of the flip f lop B12 . This calibrat- 
ed signal forces the flip flop to one, which has .the effect 
of sending a service request rqst_OS to the management proces- 
sor of the series link PGS. This processor also* receives the 
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address of the block to request on the lines adr_bloc_OS and 
the nature of the request on the lines w_OS, r_OS and fc_US, 

The request management processor PGU has terminated 
the first part of its task: the management directory RG and 
the cache memory MC are freed by deactivation of the line 
rqst^UR/ as a consequence of the arrival of the inverted 
phase 613 on the flip flop Bll. 

The mechanism for updating the cache is described in 
the paragraph "management processor for the series link PGS". 

One will note that the liberation of the management 
directory RG and the cache memory MC is necessary in order 
that the management processor for the series link PGS may 
have access thereto. 

The updating of the cache is signalled to the request 
processor PGO by the activation of the signal aclc^US. This 
signal is delivered to the input R of the flip flop B12 and 
to the input of the gate ET12. It thus has the effect of re- 
setting to zero, the flip flop Bl2 and activating through the 
logic gate ET12, the flip flop Bll and the phase distributor: 
the cycle initially started by the signal as is reproduced, 
but this time with success due to the presence of the block 
in the cache memory. 

The series management, processor PGS shown by way of 
example in FIGURE 14 is charged with managing the series link 
LS and in this way to provide the block transfer requests 
between the central memory RAM and the cache manory MC and to 
produce the corresponding operations in the management 
directory RG. It treats in priority the requests from the , 
snooper processor PE and waits in a waiting line FIFO. It 
also treats the requests from the request processor PGU. 

This management processor for the- series link PGS 
comprises a flip flop B20 which receives on its data inputs D 
the signal empty coming from the waiting file FIFO and on its 
clock inputs the output Q of a flip flop B22. A. flip flop 
B21 receives on its input the output data of a logic gate 
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(XJ20. This logic gate OU20 validates the signal rqst_US, 
received on one of its two inputs, the other input being 
connected to the signal empty; The clock input of the flip 
flop B21 coming from the output Q of the flip flop B22. The 
output Q of the flip flop B22 is reconnected on its data 
input D, which conditions it while dividing by two. The 
clock input of the flip flop B22 is connected to the output 
of a logic gate ET20, which receives on one of its inputs the 
general operating clock h and on the other input a validation 
signal. This validation signal comes from the output of a 
logic gate ET24, which receives respectively on- its two 
inputs the outputs ^ and Q of the flip flops B20 and B21. 

The flip flop B20 receives on its input R the phase 
925, inverted^ issuing from a phase distributor DP_S. the 
flip flop B21 receives on its input S the output of a logic 
gate NOU20 . This logic gate NOU20 receives on its two inputs 
the respective outputs of the logic gates ET22 and ET23. The 
logic gate ET22 receives on its inputs the phase 025 from the 
phase distributor and the signal maj from a logic gate ET29. 
The logic gate ET23 receives oh its inputs the phase 927 from 
the phase distributor and the inverted signal maj. 

The assembly of the circuits B20, B21r B22, ET20 , 
ET22, ET23, OU20, ET24, IJOU20 * constitute a fixed priority 
arbitrator ARB_S. Its operation is as follows: the flip 
flop B22 furnishes on its outputs Q and Q the alternate freq- 
uency signals half of that of the general clock. These sig- 
nals validate alternatively the flip flops B20 and B21. If a 
request for service is present on one of the. inputs of the 
flip flops B20 or B21, then these alternated signals store 
the request in the corresponding flip flop (B20 for the snoop- 
er processor PE, B21 for the request management processor 
PGU) which, in return, lock out the alternated function. One 
will note that the signal rqst^US originating from the 
request management processor PGU is conditionned by the sig- 
nal empty through the gate OO20: the signal is only 
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considered, if the waiting line FIFO is empty. The flip flop 
B20 (respectively B21) is only reset to zero when the transac- 
tion to be carried out is terminated. 

The sampling of a request on one or the other of the 
flip flops B20 and B21 translates by a change of state of the 
output of the logic gate ET24. The output of the logic gate 
ET24 is also connected to a differentiator circuit D20 which 
delivers a pulse upon changing of the state of the output of 
the logic gate ET24. The output of the differentiator cir*. 
cuit is connected, on one part» to the phase distributor DP_S 
(input S of flip flop B36 and clr20 of a shift register SR20) 
of the management processor for the series link, and for the 
other part to one of the two inputs of the logic gate 0022. 
The output of the logic gate is connected to the inputs S of 
the two flip flops B24 and B25 . The flip flop B24 receives 
on its input R the output of a logic gate N0U21 and the flip 
flop B25 the signal grnt_SR. The logic gate N0U21 receives 
on its two inputs the outputs of the logic gates ET36 and 
ET37. The logic gate E36 receives on its inputs the phase 
023 from the phase distributor and the signal maj, and the 
logic gate E37 the phase ft27 from the phase distributor and 
the signal maj inverted. 

The pulse from the differentiator circuit D20 thus 
initializes the phase distributor and sets the logic state of 
the flip flops B24 and B25. to one across the logic gate 6u22. 

The phase distributor DP_S is composed of shift regist- 
er SR20 and the flip. flop B36. Its operation is identical to 
that described in the paragraph concerning the request manage- 
ment processor P6U. 

The output Q of the flip flop B24 is connected to the 
signal rqst_SR, to the destination of the directory mainage- 
ment processor PGR. Its activation releases a request for 
service to this processor, which responds by the line 
grnt_SR, connected to the input R of the flip flop*B25.. The 
output Q of the .flip flop B25 is connected to one of the 
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inputs of a logic gate OU23. the output c£ thje logic gate 
OU23 is connected to the input en20 of the shift register 
SR20. 

The logic assembly B24 and B25 constitute a resynchron-* 
ization logic RESYNC^S between asynchronous units. Its opera- 
tion is as follows: 

A service request rqst_SR to the directory management 
processor PGR is made by activation of the flip flops B24 and 
325 through the logic gate OU22, which authorizes two origins 
of activation. The logic proper to the directory management 
processor PGR assures a response grnt_SR in an indeterminate 
timer which resets the flip flop B25 to zero. The flip flop 
B24 maintains its request until it is reset to zero by activa- 
tion of its input "5. In return, the directory meinagement 
processor PGR deactivates its line grnt_SR: the resynchroni- 
zation logic is ready to operate for a next request for serv- 
ice. The output Q of the flip flop B25 serves to block the 
phase distributor by action on its input en20 via the logic 
gate OU23: the resetting to zero of the flip flop B25 frees 
the phase distributor which furnishes the first phase 921 
upon the next active transition of the general clock h, 
connected to the input clk20 of the phase distributor. 

The line grnt^SR is also connected to' the validation 
buffers BV20, BV25 and BV21, BV22 which open access respect- 
ively to the management directory RG and to the cache memory 
MC. 

If the flip flop B20 is active, then the transaction 
in progress- is a purge of the block requested .by the snooper 
processor PE via the waiting line FIFO. The .output Q of the 
flip flop B20 is connected to the validation buffers BV24. 
These buffers receive on their input the output of a register 
REG20. The output Q of the flip flop B20 is connected to one 
of the two inputs of a logic gate 0U21, which receives on its 
other input the output of the differentiator circuit D20. 
The output, of the logic gate 0U21 is connected to the input 
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Ioad20 of the register REG20 and to the input read20 of the 
waiting line FIFO. 

Thus, the activation of the flip" flop B20 provokes: 

1. Initialization of the phase distributor, 

2. A request for access to the management 
directory R6 and to the cache memory MC, 

3. The loading' of the element at the head *of the 
waiting line FIFO in the register REG20 and advancing of the 
waiting ling, 

4. The validation of the buffer BV24: the bus 
adr_x contains the address of the block to be purged. The 
nature of the operation (carried out) will be relocated from 
the combination of the bits v and m (signal maj). 

If the flip flop B21 is active, then the transaction 
in progress comes from an information defectin the cache 
memory MC* The output Q of the flip flop B21 is connected to 
the validation buvvers BV23, These buffers receive on their 
input the information originating from the request management 
processor PGU. 

Thus, the activation of the flip flop B21 provokes: 

1. The initialization of the phase distributor, 

2. A request for access to the management 
directory RG and to the cache memory MC, 

3. The validation of the buffers BV23: the bus 
adr_x contains the address of .the block which has provoked 
the information defectin the cache memory MC, and the nature 
of the request: -reading or writing, shared data of 
non-shared data (lines fcjas)^ 

The field "frame" df the bus adr_x is .connected to the 
address lines adr_RG of the management directory through the 
Validation buffers BV20. the outputs Q of the flip flop B26 
and B27 are respectively connected to the lines r_RG and w_RG 
of the management directory through the validation . buffers 
BV20. The flip flop B26 receives on its input S the output 
of the logic gate OU22 and on its input R the phase 022 
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inverted, coining from the phase distributor. The flip flop 
B27 receives on its input S the output of a logic gate NOU22 
and on input R the output of a loghic gatd NOa23. The logic 
gate tIOU22 receives on its two inputs the respective outputs 
of the logic gates BT25 and ET26, themselves receiving on 
their inputs, for the gate BT25 the phase 022 and the signal 
maj, and for the gate ET26 the phase 926 and the signal ma j . 
The logic gate NOU23 receives on its two inputs the respec- 
tive outputs of the logic gates ET27 and ET28 , themeselves 
receiving on their inputs, for the gate ET27 the phase 623 
and the signalmaj inverted, and for the gate ET28 -the phase 
027 and the signal maj inverted. 

The field tag of the bus adr_x is connected to the 
field tag of the lines data_RG through the validation buffers 
BV25. These latter receive on their validation input the 
output of a logic gate OU24 which receives on its inputs the 
signal grnt_SR and the output Q from the flip flop B27. 

The inputs D of the flip flops B28 and B29 are respect- 
ively connected to the bit validation lines v and bit modifi- 
cation lines m of the bus data_RG. The clock inputs of these 
flip flops iB28 and B29 are connected to the phase 022 of the 
phase distributor. The output Q of the flip flop B28 and the 
output Q of the flip flop B29 are connected to two inputs of 
a logic gate ET29., which furnishes on its output the signal 
maj. A logic gate OU25 receives on. its inputs the signal maj 
and the output Q of a flip flop B30. The output of the logic 
gate OU25 is connected to the selection input sel20 of a 
multiplexer MUX20,^ which receives on its two data inputs the 
output Q of the flip flop B28 (bit v) and the constant 0, the 
constant zero being selected when the selection input sel20 
is at the logic state of one. The output of the multiplexer 
MDX20 is connected to the bit validation line v of the bus 
adr_x. The bit waiting line -a- of the bus adr_x is forced 
to the logic state zero. The modification bit -m- is connect- 
ed to the reading line r_adr_x of the bus adr_x. 
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The logic assembly described above constitutes the 
access logic and operation logic of the management directory 
^* o peration is as follows: the activation- of the 

signal grnt_SR, authorizing access to the management director 
and to the cache memory, validates the validation buffers 
BV20. The reading of the descriptor concerned is controlled 
from the beginning of the access authorization until the 
arrival of the phase 922, the storage instant of the bits -v- 
and -ra- in the flip flops B28 and B29 . The combined state Of 
these two bits produces, across the logic gate BT29 , the 
signal maj which conditions the succeeding operations-. 

First Case; . maj = 1. This case is produced when 
the validation bit is equal to 1 and the modification bit is 
equal to 1. This case corresponds either to a request to 
purge the block from the snooper processor PE, or to an 
information defect established by the request management 
processor P60 upon placement of the block occupied and 
modified: in the two cases, the block concerned must be 
written into the central memory RAM. 

TO this effect, the field -cadre- of the bus adr_x is 
connected to the lines adr_MC via the validation buffers 
BV21. The outputs Q of the flip flops B30 and B31 are respec- 
tively connected to the lines r_MC and wjJC of the cache mem- 
ory MC through thp validation buffers BV21. The line bl^ is 
forced to zero via one of the. validation buffers BV21. The 
data lines data__MC of the cache memory are connected, via the 
bidirectional validation buffers BV22 to the inputs of the 
shift register RDP and to tb'e outputs of the validation buf- 
fers BV26, which receive on their inputs the. output lines of 
the shif t reg ister RDP. The buffers BV22 are validated by 
the line grnt^SR, and their direction of validation controll- 
ed by the output Q of the flip flop B31. The flip flop B 30 
receives on its inputs § and R respectively the phases 921 
and 923, inverted, coming from the phase distributor. the 
flip flop B31 receives on its inputs S and- R respectively the 
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outputs of the logic gates ET32 and ET33. The logic gate 
ET32 receives on its inputs the phase 926 and the signal maj 
inverted, the logic gate ET33 the phase 827 and the signal 
maj inverted. A logic gate NET20 receives on its inputs the 
phase d23 and the signal maj. The output of . a logic gate 
ET35 controls the validation buffers BV26, and receives on 
its two inputs respectively the signal maj inverted and the 
output Q of the flip flop Q31. The output of the logic gate 
NET20 is connected to the signal load21 of the shift register 
RDP and to the input S of a flip flop B32. The input R of 
this flip flop B32 receives the signal f in_transf ertjoaaj com- 
ing from the logic TFR associated with the shift register 
RDP. The output Q of the flip flop B32 is connected to one 
of the inputs of the logic gate Q023. 

The logic described above permits purging a block from 
the cache memory. Its operation is as follows: 

In parallel with the access to the management direc- 
tory RG, a reading of the cache memory MC is activated by the 
line r_MCf coming from the flip flop B30, during the phases 
Q21 and 322. At the output of this reading, the data read, 
representing the block to be downloaded, are presented at the 
input of the shift register RDM. The activation of the sig- 
nal maj, the state of which is known from the beginning of 
the phase 822, provokes: 

1. The invalidation of the block in the manage- 
ment directory: the input sel20 of the multiplexer MUX20 
being at the logic .state one, the value zero is forced on the 
validation bit, the descriptor being written in the manage- 
ment directory RG with the activation of the. -signal w_RG, 
controlled by the flip flop B27 during the cycle Q22 , 

2. The loading of the shift register RDH and the 
activation of the transfer, upon passage of the phase 622 to 
the phase 923, 

3 . The placement at the logic state one of the 
flip flop B32, which comes to block the phase distributor on 
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the phase 023 until the end of the transfer, signalled by the 
signal f in__transf ert_maj coming from the logic TFR, 

4. The liberation of the management directory R6 
and of the cache memory MC by resetting to zero the flip flop 
B24 on the phase 823. 

Thus, the access to the directory if freed (therefor 
accessible for the snooper processor PE) , the transfer of the 
operation is in progress, and the phase distributor blocked 
on ©23. 

When the transfer is terminated, the block is in 
waiting state in the shift register RDM and it is necessary 
to activate the management processor of the parallel link in 
order that the writing into central memory RAM will be 
effective. 

To this effect, the flip flop B33 is connected by its 
output Q to the service request line rgst_SP at the destina- 
tion of the parallel link management processor PGP. The out- 
put Q is connected to one of the inputs of the logic gate 
CW23, the input 3 to the phase 824 inverted from the phase 
distributor and the input to the signal acir_SP. The bus 

is connected to the bus adr_bloc_SP of the parallel 
link management processor PGP. One of the lines of the bus 
adr^blbc_SP receives the signal maj in order to indicate the 
nature of the request: position open. 

After the liberat ing ot the phase distributor by the 
signal f in_transf ert_maj , the next active transition of the 
general clock h causes the passage of the phase 823 to the 
phase 624. The phase 824 provokes a service request from the 
parallel link management processor PGP (activation of the 
line rqst^SP) and the blocking of the phase distributor until 
discharge of the request by the signal ack^SP. -At this 
moment, the writing in operation will be effectively achieved 
by the parallel link management processor PGP. The phase 
distributor, on the next active transition of the clock h, 
comes to pass from the phase 824 to the phase 825. 
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The updating of the central memory RAM is terminated: 
the flip flop B20 is set to zero by activation of its input R 
by the phase 025 inverted. The flip flop B212 is set to one 
by activation of its input S by the phase 025, conditionned 
by the signal roaj by means of the logic gate ET22 via the 
logic gate NDU22. In case of information defect in the cache 
memory, the purge of a block only constitutes the first part 
of the request: the request rqst_as is always present, but 
the liberation of the flip flop B21 is going to permit event- 
ual consideration of requests for operation while waiting in 
the waiting line FIFO. When the waiting line FIFO- is empty 
(empty = 0), the whole cycle, described previously is reproduc- 
ed, with, this time, the validation bit at zero.' On is then 
in the following case: 

Second Case: maj - 0. This case arises when the valida- 
tion bit is equal to zero (by following perhaps., a purge of a 
block) or when the 5^alidation bit is equal to one, but the 
modification bit is equal to zero:, the current copy of this 
block is already in the central memory RAM. 

Thus, the request for service rqst^SR comes to cause 
in return an accord for access to the management directory RG 
and to the cache memory MC with reading of the descriptor, 
storing the bits m and v, generation of the signal maj, re-- 
writing of the descriptor with v « 0 (maj is at the logic 
state zero, the flip flop B31 is at the logic state zero and 
its output Q has a value of one, which imposes a logic signal 
at the state one oii the input sel20 of the multiplexer MOX20 
and therefor forces the constant 0) and thd freeing of access 
to the mauiagement directory RG and the cache. memory MC. The 
operations are effective upon activation of the phase 02 3. 

The block request must now be read in the central mem- 
ory RAA. To this effect, the 'signal maj inverted is received 
at one of the inputs of a logic gate NET21, which receives on 
its other input the phase 025, The output of the logic gate 
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NET21 is connected to inputs s" of the flip flops B34 and B35. 
The flip fl op B34 receives on its input R the signal 
£in_reception coining from the transf r logic TFR. Its output 
Q is connected to the differentiator circuit D21. This cir- 
cuit is connected to the logic gate 0022. the input R of the 
flip flop B35 is connected to the signal grivt_SR and the out- 
put 5 on one of the inputs of the logic gate 0023. 

The reading of a block in the central memory RAM and 
its transfer into the cache memory are realized in the follow- 
ing manner: the transition of the phase 823 to the phase 624 
unlatches a service request to the parallel link management 
processor by activation of the line rqst_SP, coming from the 
flip flop B33. The type of operation is this time a reading 
(r_adr_x » 0) or a writing (w_adr_x = 0) and the bus adr_x 
furnishes the address of the block requested on the bus 
adrj5loc__SP. The phase distributor is blocked until arrival 
of the release signal aclc_SP: the reading or writing request 
has been made to the central memory RAM by the parallel con- 
nection management processor PGP, and the block has been at 
the same time validated and marked "in waiting" by this, same 
processor. the transfer is then in progress, from the cent- 
ral memory RAM to the shift register RDP. 

The phase distributor, freed, furnishes then the phase 
825.. This phase, by the intermediary of the logic gate NET21 
which receives on its other input the signal maj inverted, 
comes to set to the logic state one, the flip flops B34 and 
B35. The flip f ibp • B35 blocks the phase distributor. The 
flip flop B34 is reset to zero upon arrival of' the block 
request in the register RDP, signalled by the activation of 
the line f in_reception coming from the transfer logic TFR, 
which releases the activation of the differentiator circuit 
D21 and an access demand to the management directory RG and 
the cache memory MC, by the activation of the line rqst_SR 
from the flip flop B24. • " 
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The access accord, signalled by the line grnt_SR, 
frees the phase distributor by reseting the flip flops B25 
and B35 to zero, and opening the access to the management dir- 
ectory RG and to the cache memory MC. The phase distributor, 
upon the next active transition of the general clock hr furn- 
ishes the phase 026. 

The flip flop B27, connected to the signal w_RC of the 
management directory, is activated by Q26 and Q27 , which per- 
mits providing the operation of the memagement directory 
with: 

field tag of the bus data_RG = field tag of the 

bus adr_x, 

validation bit v » 1 (maj » 0 and the output Q of 
the flip flop B31 is at zero: the multiplexer MUX20 allows v 
to pass, which has been forced to one by the parallel connec- 
tion management processor PGP, or already re^et to zero by 
the snooper, processor PE) , 

modification bit m »» state of the line r_adr_x of 
the btis adr_x (writing request causes m » 1, reading request 
m » 0) , 

awaiting transfer bit a » 0, 
bit f « 0. 

The flip flop B31 activates the signal w_MC of the 
cache memory MC of 626 and 927, which permits writing trhe 
contents of the shift register RDM (the buffers BV26 are 
validated by the output Q of the flip flop B31 and the signal 
maj inverted) in the good placement (field -cadre- of the bus 
adr_x connected to the bus adr_MC) of the cache memory , via 
the validation buffers BV22, controlled in the direction of 
tralsfer by the flip flop B31. 

Upon arrival of the phase 627, the operation of the 
management directory and of the cache memory' is terminated. 
The arrival of the phase 827 provokes the resetting of the 
flip flop B24 to zero, which frees the access to the manage- 
ment directory RG and. to the cache memory MC and the 
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activation of the output of the logic gate ET23, resetting 
the flip flop B21 to zero and activating the signal ackJJS to 
the destination of the request management processor PGO: the 
request is terminated. 

In addition, the snooper processor PE is charged with 
maintaining the coherence of the data in the cache memory MC. 
By way of example, there is ^ shown in FIGURE . 15 an architec- 
ture of this processor. This is released, at each active 
transition of the signal valid from the common bus BOSA. 

If the signal valid is activated by a parallel connec- 
tion management processor PGP other that that associated with 
the snooper processor PE, then the task of this latter is as 
follows, as a function of the request: 

request for reading of the block of oon- shared data, 
or request for writing of the operation of the block: 
nothing, 

request for reading of the block of shared data: 
block absent: nothing, 

block present and not modified: nothing, 
block present and modified: request for purge of 
the block (that this block be present or in the course of 
transfer from the central memory RAM to the cache menory MC, 
state signalled by the bit -s- of the descriptor), 

request for writing of the block of shared data: 
block absent: nothing, 

block present unmodified: invalidate the block, 
block present modified: request for purge of the 
block (same remark as above), 

request for informative writing of the block: 
block absent: nothing, 

block present unmodified: invalidate the block, 
block present modified: impossible case, 
if a request for informative writing is in wait- 
ing on this 3ame block, it is treuisformed into a request for 
writing of the block, since this block has been invalidated: 
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to this effect, the request in progress is annulled and a 
release is sent to the request management processor PGU. 
This latter consults the directory RG amd comes to find the 
block absent: a request is sent to the management processor 
PCS . This operation is taken into consideration by the para- 
llel ioaanagement processor PGP. 

If the signal valid 'is activated by the parallel con- 
nection management processor PGP associated with the snooper 
processor, then the task of the latter is as follows, func- 
tion of the type of request (termed local request): 

request for reading of the block of non- shared 
data or request for writing of the operation of this block: 
nothing, 

request for reading of the block of. shared data: 
mark the block valid, while waiting for transfer and modified 
(v = m = s = l), 

request for informative writing of the block: 
mark the block "modified" (m « 1) . 

To assure the functions described above, the snooper 
processor PE comprises a phasae distributor DP_E, comprising 
a shift register SR40 and a flip flop B40. The output of a 
differentiator circuit D40 is connected to the input S of the 
flip flop B40 and the input clear 40 of the register SR40 , as 
well as the input S of a flip flop B.41. The output Q of the 
flip flop B40 is connected to. the input serial_in40 of the 
register SR40. The input clk40 of the register SR40 receives 
the signal h from the general clock and the validation input 
enUr is connected to the output Q of the flip. flop B43. The 
phase B41, inverted, coming from the register .SR'40 is connectr 
ed to the input R of the flip flop B40. 

The operation' of the phase distributor is in accord- 
ance with the description given for the" request management 
processor PGU. 

The signal valid of the bus BUSA is connected to the 
input of -the differentiator circuit D40 and to the validation 
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input en41 of a decoder DEC 41. The flip flop R41 receives on 
its input R the output of the logic gate NOO40 . The output Q 
of the flip flop B41, to. the open collectorr is connected to 
the signal done of the bus BUSA. 

The signals valid and done assure the synchronization 
of the snooper processor PE with the other snooper processors 
of the multiprocessor system: the negative transition of the 
signal valid unlatches the differentiator circuit D40 which 
produces a pulse permitting activation of the phase distribut- 
or and sets the signal done at the logic state zero through 
the intermediate flip flop B41, The end of the wo.rk of the 
snooper is signalled by a change of state on the output of 
the logic gate NOU40, which produces the setting' of the logic 
state of the signal done to one through the intermediary of 
the flip flop B 41. 

The work o£ the snooper is a function of the nature of 
the request and, to this -effect, the field type of the bus 
BUSA is connected to the input o£ the decoder DEC41. The out- 
puts dnp and dmaj of the decoder DEC 41 are connected to the 
inputs of a logic gate OU40. The outputs di , de, dei of the 
decoder DBC41 are connected to the inputs of a logic gate 
0041, the outputs de and dei being both connected to the 
inputs ofr a logic gate OU42. The output of the logic gate 
OU40 is connected to one of the inputs of the logic gate 
NOU40 by the intermediary of a iogic gate ET40, which 
receives on its other input the signal Q41. The output of 
the logic gate 0U41 .is connected respectively to one of the 
inputs of the logic gates ET42, ET43, which receive on their 
other input respectively the phase signal 341 and 344. 

The outputs of the logic gates ET42 and ET43 are con- 
nected respectively to the inputs S* and R of a flip flop B44. 
The output of the logic gate ET42 is also 'connected to the 
inputs S of the flip flops B42 and B43. A logic gate N0U41 
receives on its inputs the phase signal 345 and the output of 
a logic gate ET45, which receives on its two inputs the phase 
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9-44 and the inverted output of a logic gate 0043. The output 
of the logic gate N0041 is connected to the input R of the 
flip flop B42. The output ^ of the flip flop B42 is connect- 
ed to the signal rqst_PR and the signal grnt_PR is delivered 
to the input R of the flip flop B43, to the validation inputs 
of the passage buffers BV40 , and to one of the inputs of the 
logic gate* 0044. The logitJ gate 0044 receives on its other 
input the output Q of a flip flop B45, and its output valid- 
ates the passage buffers BV41. The output of the logic gate 
ET45 is also connected to one of the inputs of the logic gate 
1IOO40 which receives on another input the phase signal 345. 
the output of the logic gate 0042 is connected to the inputs 
of the logic gates BT46 and ET47, which receive also on their 
inputs the output of a logic gate 0043 and respectively the 
phases ^44 and &45 . 

The output "q of the flip flop B44 delivers the signal 
f_RG via a buffer BV40, the" output Q of the flip flop B45 
delivers the signal w_RG via a buffer BV40. The field cadre 
of the common bus BUSA. is connected to bus adr_RG via a 
buffer BV40. The bus data_RG is connecte4d to the outputs of 
the validation buffers BV41 and to the input of the register 
REG40, which receives on its input load40 the phase signal 
643. The output of the register REG40 is connected, for the 
part tag, to the inputs of the buffers BV41 and to one of the 
inputs of a comparator COMP40. The comparator COMP40 
receives on its other input the part tag from the bus BUSA. 

The validation bit v, issued by the register REG40, 
connected to the comparator COMP40 has opposite the constant 
value 1. The bits v, a, m, f in the output , of the register 
REG40 are connected respectively to one of the inputs of the 
multiplexers MOX40, M0X4I, M0X42, MOX43. The outputs of 
these multiplexers furnish the state of these same bits to 
the inputs of the buffers BV41. The multiplexer MOX40 
receives on its othej; inputs the constants zero and- one, the 
inputs sel40 are connected to the output of a logic gate ET48 
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and to a signal dlle* The multiplexer MaX41 receives on its 
other input the constant one, selected when its input. sel 41, 
receiving a signal dlle from the management processor PGP, is 
at the logic state of one. The multiplexer MOX42 receives on 
its other inputs the constant 1 and the signal r_adrbloc_SR, 
on its inputs sel42 receiving the signals dlei and dlle com- 
ing from the management processor P6P« The multiplexer MUX43 
receives on its other input the constant one, selected when 
its input sel43, connected to the output of a logic gate 
ET49, is at logic state one. 

The logic gate ET49 receives on its inputs the output 
eg 40 of the comparator COMP40, the signal f inverted and the 
signal m. The logic gate ET48 receives on its inputs the out- 
put eg40, the signal m inverted, the signal dlei and the out- 
put of the logic gate 0042. The logic gate 0043 receives on 
one of its inputs the signal eg 40, and on its other input the 
signal dlle. The output of the logic gate ET49 is also con- 
nected to the input load41 of the waiting line FIFO, already 
written in the series link management, processor PGS. The 
fields cadre and tag of the bus BOSA are connected to the 
inputs of the waiting line FIFO. . The signals dlei, dlle and 
r_adrblock_SP come from the parallel link management 
processor PGP,, which receives also the signal dei of the 
decoder DEC41. 

The operation of the assembly is as follows: the acti- 
vation signal valid initializes the phase distributor DP_E 
and validates the decoder DEC41 which produces the activation 
of an output. as a function of the information type coding the 
nature of the demand in progress on the common .bus BOSA. The 
active output may be: 

dnp: request for reading of non**shared data. The 
signal done is deactivated upon appearance of the phase 641; 

- dmaj: writing request for operation of the block. 
The signal done is deactivated at phase 941; 

- dl: request for reading the block; 
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- de: request for writing the block; 

- dei: request for informative writing. 

Theswe three cases necessitate an. access in reading to 
the directory RG, and the two latter an eventual re-writing 
of the directory. to this effect, a request for access is 
sent to the directory management processor PGR by the flip 
flop B42 (signal rqst_PR) the flip flop B43 blinding the 
phase distributor until access agreement, signified by the 
signal grnt_PR. The reading thus carried out of ©41 and 644 
(flip flop B44) and the eventual writing of Q44 to Q45 (flip 
flop B45) with storage of the descriptor in the, register 
REG40 from the phase 843. If the block is absent from the 
cache memory MC (eg40 =0), then the signal done is deactivat- 
ed on the phase 644. If the block is present in the cache 
(eg40 = 1) , then: 

- if m « 1, a request to purge is activated (activa- 
tion, logic gate ET49) on the condition that this block is not 
already in the line (f - 0); the only bit modified is f, set 
to one, at the time of re-writing of the "descriptor, 

- if m » Or the block is invalidated by the multiplex- 
er MaX40 (activation of the logic gate ET4'8) , 

- if the request is local (dlle or diei active) then: 

1) in . case of reading or writing, the bits v and 
a are set to 1 and me is set to 0 (reading) or 1 (writing) 
(state of the signal r_adrbloc_SP) , 

in the case of informative writing, the bit m is 

forced to 1. 

In these : latter cases which require a re-writing in 
the management directory RC, the signal done is deactivated 
on the phase 845. 

The parallel link management processor PGP, an example 
of which is shown in FIGURE 16, is charged with requesting 
the common bus BUSA and providing the transaction requested, 
either by the request management processor PGO, or by the 
series link management processor PGS. 
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A request coming from the request management processor 
PGU can only be an informative writing request. A request 
coming- from the series link management processor PGS is 
either a request for reading or writing of the block, or a 
request for the current setting of the block. 

The parallel link management processor PGP comprises a 
flip flop B60, connected -through its data input D to the 
signal rqst^UP. A flip flop B61 is connected through its 
data input D to the signal rqit^SP. the outputs Q and Q of a 
flip flop B62 are connected respectively to the clock inputs 
of flip flop B62S B60 and B61. theoutput Q of the .flip flop 
B62 is looped around to its data input. The outputs p of the 
flip flops B60 and B61 are connected to the inputs of a logic 
gate OO60 . The output of the logic gate OU60 is connected, 
first, to a differentiator circuit D60, and in an inverse man- 
ner, to an input of a logic gate ET60, which receives on its 
other input the general clock signal h. The output of the 
logic gate ET60 is connected to the clock input of the flip 
flop B62. The output of the differentiator circuit D60 is 
connected to the input S of a flip f lop B6 3. The output Q of 
the flip flop B63 delivers a signal ^iitl to the destination 
of the arbitrator AB, and its input R is connected to the out- 
put of a logic gate ET62. The signal grHH coming from the 
arbitrator AB is connected to the logic gates OU62 and NOU60. 
the l ogic gate OU62 receives on its other input the signal 
valid inverted, and its output is connected to the input of a 
differentiator circuit D61. The output of this circuit D61 
is connected to one of the inputs of the logic gate BT62 and 
to the input S of a flip flop B64. The output Q of this flip 
flop B64 is connected to the validation input of the passage 
buffers BV60 and in an inverted maner, across an inverter 
''^^^ ^" <*P«« collector I60', to the signal I^ITIZ. The signal 
done is connected to the input of a differentiator d62. the 
output of this circuit D62 is connected to the input R of the 
flip flop B64 and to one of the inputs of the logic gate 
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NOO60. The output o£ this logic gate NOU60 is connected to 
one of the input logic gates NET60 and NET61 , which receive 
on their other inputs respectively the outputs Q of the flip 
flops B60 and B61, The outputs Q of the flip flops B60 and 
B61 are also connected respectively to the validation inputs 
of the passage buffers BV61 and BV62 • The output of the 
logic gate NET60 is connected to the input S of the flip flop 
B60 and to one of the inputs of a logic gate ET63, which 
receives on its other . input the output of a differentiator 
circuit D63. The output of the logic gate ET63 delivers the 
signal aicq_UP to the management processor PGU. The output of 
the logic gate ET63 delivers the signal acq_UP to the manage- 
ment processor PGU. The output of the logic gate NET61 is 
connected to the input S of the flip flop B61 and furnishes 
the signal "acJc_SP • The bus adrjDloc_UP is connected to the 
input of the validation buffers BV61 and to one of the inputs 
of a comparator COMP60 • • 

The bus adr_bloc_SP is connected to the input of the 
valiiiation buffers BV62. The outputs of the buf f ers .BV61 and 
BV62 are connected together and to the input of the 
validation buffer BV60 . The output of the buffer BV60 is 
connected to the common bus BOS A. The logic gates 0063 and 
0064 receive on their inputs respectively the outputs Q of 
the flip flops B60 and 861, the logic signal grhti and the 
signal maj for 0064. The output of the logic gate 0063 
delivers the signal dlei, and the output of the logic gate 
0064 the signal dlle. The other input of comparator COMP60 
receives the fields tag and cadre from the common .bus BOSA. 
The input en60 of the comparator COMP60 is connected to the 
output of the logic gate ET61, which receives on its inputs 
the signal grnti inverted, the- signal dei and the signal 
rqst^OP inverted. The output eg60 of the comparator COMP60 
is connected to the input of the differentiator circuit D63. 
The output of this circuit is also connected to the inputs R 
of the flip flops B60 and B61 and the other input of the 
logic gate ET62. 
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The operation of the assembly is as follows: 
The flip flops B61 and B62 associated with the logic gates 
ET60 and ET61 constitute a local arbitrator. This arbitrator 
examines alternatively the requests rqsE_OP and rqsI_SP/ and 
resends them to the arbitrator AB of the common bus BUSA by 
the signal rqsti . The access accoxfd is furnished by the val- 
idation of the signal grnti and cycle bus takes place upon 
deactivation of the signal' valid which .frees the arbitrator 
AB. The activation of the signal done frees the local 
arbitrator: the transaction is terminated. 

If the request comes from the series link management 
processor PGS, then the signals dlei and dlle indicate to the 
associated snooper processor PE the nature of the updating of 
the status bits of the block to be carried out in the direct- 
ory RG. 

if the request comes from the request management proc- 
essor PGOr the in case of detection of informative writing on 
the same block (signal del coming from the snooper processor 
PE) ^ an immediate liberation takes place: the request manage- 
ment processor PGO, after consultation of the directory (the 
block has been invalidated) will direct its request to the 
series link management processor PGS . 

The memory management processor PGM^ one example of 
which is shown in FIGURE 17, is charged with assuring the 
reading or writing of the block in the central memory RAM and 
to participate in maintaining the coherence of the informa- 
tion in the various cache memories MC of the multiprocessor 
system . 

To this end, it comprises a differentiator circuit D80 
receiving on its input a signal valid and connected by its 
output to the. inputs S of the flip flops B80 and B81 as well 
as to the input clr_80 of a shift register SR80. On the put- 
put Q of the flip flop B80, with an open collector, is deliv- 
ered the signal done. The output Q of the flip flop B81 is 
connected to the input serial_in80 of the register SR80. 
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This flip flop B81 is connected by its input R to the output 
Q81, inverted, of the register SR80. The register SR80 
receives on its input clk80 the general clock signal h and 
its validation input "enffff is always active. The flip flop 
B8i and the shift register SR80 constitute a phase distribut- 
or PPJM. The signal valid is also delivered on the valida- 
tion input enSI of a decoder DEC80. This decoder is connect- 
ed by its data input to the part type of the common bus BOS A, 
and furnishes the signals dnp, dl, de, ei and maj. A read- 
write memory RAMFG of a size of two bits (termed respectively 
ro and rw) receives on its address bus the fields tag and 
cadre of the common bus BOS A. The data bus of this memory, 
constituted of bits ro and rw, is connected by one part to a 
logic PAIiBO, and to a logic gate ET80, directly for rw, and 
inverted for ro. The logic PAL80is connected to the field 
type and receives the logic signals cl, r/w, s/n, raff and 
en82: the signal cl is emitted from a flip flop B82 , the 
signals r/w and s/n from an associative waiting line APIPO, 
the signal mff from a logic gate ET81, and the signal en82 
from a flip flop B83, which receives on its inputs S and R 
respectively the signals 082 and 081 inverted from the phase 
distributor DP_M. The logic PAL cables on its outputs ro-rw 
the following logic equations: dnp - 10; dl.raff » 10; dl.mff 
= 01; de = 01; maj.cl = 00; ei = 01 ; ma j .cl • s/n .rTw = 10; 
maj.cl.s/n.fTw = 01. The output of the logic gate ET80 is 
connected to the input D of a flip flop B84 which receives on 
its clock input the phase e82 . The output Q of this flip 
flop is connected to one of the inputs of the logic gate 
ET81, which receives on its other input the output of the 
logic gate OU80. The two inputs of this logic gate 0080 are 
connected to the outputs de and dl of the decoder DEC80.. The 
reading input r o£ the memory RAMFG is connected to the phase 
e81, and the writing input w to the output of a logic gate 
ET82. The logic gate ET82 receives on its inputs the signal 
.©83 and the output of a logic gate ET83, the inputs of which 
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are connected to the signal s/n and the phase 9-87 • The out- 
put dnp of the decoder DEC80 is connected to the logic gates 
ET84 and ET85, which receive on their other input respective- 
ly the phases 681 and 885. The signal s/n is also delivered 
to the logic gates ET86 and ET87 which receive on their other 
input respectively the phases 986 and 690 • The output mff of 
the logic gate BT81 is also connected to a logic gate ET88, 
which receives on its other 'input the phase 983, and in an in- 
verted manner to the logic gates ET89 and ET90 which receive 
on their other input respectively the phases -983 and 987 • 
The output of the logic gate ET88 is connected to the input 
wff of the line APIPO. The outputs of the logic gates ET84, 
ET86 and ET89 are connected to the, inputs of a logic gate 
0U81, the output of which is connected to the input S of a 
flip flop B85, The outputs of the logic gates ET85, ETS? , 
ET90 are connected to the inputs of a logic gate 0082 of 
which the output is connected to the input R of the flip flop 
R85 . The signal s/n is also, connected, in an inverted man- 
ner, to a logic gate ET91 which receives on its other input 
the phase 089. The output of the logic gate ET91 is connect- 
ed to the input of a logic gate NOO80 which receives on its 
other input the output of the logic gate 0082. The output of 
the logic gate iSfOOSO is connected to the input R of the flip 
flop B80. The output maj of the decoder DEC80 is connected 
to the input of the logic gates ET92, ET93r ET94, BT95 , which 
receive on their other input respectively the phases 681/ 
©85, 685, 691. The outputs of the logic gates ET92 and ET93 
inverted are connected respectively to the inputs S and R of 
a flip flop B86, and those of logic gates ET94 and ET95 to 
the inputs S and R of a flip flop B82 . The output Q of the 
flip flop B82 produces the logic signal cl also delivered to 
the input cff of the line AFIFO and ^to the control input 
sel80 of a multiplexer Max80 . The part tag, cadre of the 
common bus BOSA is connected to the da.ta input of the line 
AEIFO and to one of the data inputs of the multiplexer MUX 80 . 
The output dl of the decoder DBC80 is also connected to one 
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of the data inputs of the line AFIFO in order to produce the 
reading /writing signal 1/e. The data output of the lint 
AFIFO is connected to the other input of the multiplexer 
MUX80 . The output of the multiplexer MOX80 is connected to 
the address bus of the central memory RAM for the part 
tag. cadre, and to the inputs of the decoders DBC81 and DEC82 
for the part champ cpu . The output Q of the flip flop B86 is 
connected to the writing input of the central memory RAM and 
to the input en84 of the decoder DEC82 • The output Q of the 
flip flop B85r slightly retarded, is connected to the reading 
input of the central memory RAM and to one of the inputs of a 
logic gate ET96, which receives on its other input the output 
of the logic gate OU82*. The output of hthe logic gate ET96 
is connected to the input en83 of the decoder DEC81. The out- 
put j of the decoder DEC81 is connected to the validation 
input of the passage buffers of the memory shift register 
RDM j and the output j of the decoder DEC82 to the loading 
input of said memory shift registerRDM^ . 

The operation of . this assembly is as follows: 
The activation of the signal valid provokes the un- 
latching of the phase distributor DP_Hr and the validation of 
the decoder DEC80, which comes to permit the determination of 
the nature of the request. The phase 081 is used for reading 
the state of the bits corresponding to the block requested in 
the memory RAMPG, and the combination ro.rw is stored in the 
flip flop B84. A first writing takes place in the memory 
RAMPG on the phase 083 which permits updating the status 
bits. Their value is furnished by the logic^ PAL80 and per- 
mits obtaining the following chains: 

- in case of request for a block of non-shared data 
(dnp) then regardless of the state of the bits ro.rw (rw is 
forced to zero), the state 10 is forced ("block broadcast in 
reading") ; 
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- in case of request Eor a block for reading (dl) or 
writing (de)r if ro.rw = 01, then the request is placed in 
the waiting line on the phase 083 and the state 01 is forced 
(in fact this is the preceding state), if not the state 10 is 
forced in case of reading ("block broadcast in reading") and 
the state 01 is forced in case of writing ("block broadcast 
in writing") ; 

- in case of request for updating (maj), the state 00 
is forced ("block not broadcast")* In these various cases, a 
reading or a writing into the central memory RAM is carried 
out, tow-ard or away from the memory shift register RDM^ 
identified by the field cpu of the common bus BOSA. In the 
example chosen, the duration of the memory cycle RAM is 4 
periods of the general clock h. In the case of reading of 
non-shared data, the cycle is carried out from &81 to 885, in 
the other cases from 083 to 887. The writing is carried out 
from 981 to 985; 

- in case of informative writing, this does not cause 
movement of data, but forces the status bits to the value 01 
(state of departure is in this case forcedly 10); 

- in case of request for updating, a consultation of 
the waiting line AFIFO is systematically carried out. This 
consultation may cause the reading of a block, in the case 
where a central unit CPU is waiting in the line AFIFO for 
operating this block. 

The reading is carried out from 086 to 090 and the 
state of the bits is forced to 10 (request for a. reading) or 
01 (request for writing) . The end of all operation trans- 
lates by resetting to zero the flip flop B80 which activates 
the signal done. This deactivation may be produced on the 
phases 985, 987 or 991 according to the requested .operation/ 
or on 989 if the consultation of the line gives a negative 
result. 

The associative waiting line is not detailed. It con- 
stitutes in a conventional manner an associative store used 
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in the waiting line. the number of words o£ this memory is 
equal to the number of the central unit CPU of the multipro- 
cessor system. An internal "daisy chain" identifies on each 
phase 981 the next candidate word to a writing, which is 
produced on the phase 083 by the signal wff. The signal cff 
unlatches a comparison starting fromthe phase 085, the flip 
flops of the response memo):y having been reset to zero on the 
phase e84. The result of the comparison is propagated on the 
signal s/n (some/none) and the contents of the word concerned 
is available on the data output from the phase 986. This 
word is then invalid on the phase 890. 

In the architecture described above, the snooper 
processors PEj are urged to each address transfer on the 
common bus BUSA, with eventual consultation of their cache 
memory MCj. This consultation is most of the time useless 
(low probability of the presence of the address of the block 
corresponding to the transfer, in the cache memories). 

It should be .noted that the memory management process- 
or PGM keeps • the state ' bits of the blocks and makes possible 
a central management for maintaining coherence. To this 
effect, one can add to the architecture described above 
(FIGURE 10) a parallel synchronization bus operating accord- 
ing to the same algorithm as the synchronization bus SYNCHRO 
of the variation which is described hereafter. The snooper 
processors are no longer then properly spoken of a snoopers 
(since connected on the synchronization bus and not on the 
common bus BOS A) , and are designed as coherence maintenance 
processors (PMCj for the variation in FIGURE 18). Thus, 
the memory management processor PGM remains urged to each 
transfer on t^e common bus BOS A, but the coherence mainten- 
ance processors are urged by the processor PGM only when they 
are concerned by the transfer, 

FIGURE 18 presents a variation in which the coherence 
is maintained according to the principle discussed above. 
,This variation takes the general architecture of FIGURE 6/ 
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with the address of the blocks which transfer by the series 
links LSj. This system comprises a parallel synchro- 
nization bus SYNCHRO, of the same logic structure as the com- 
mon bus BaSA, but directed at the sole initiative of the 
memory management processor PGM. 

The structure of the central unit UC^ is conformed 
to that presented in FIGURE; -10 , with some modifications: 

- the structure of the cache memory MCj remains the 
same, as well as the structure of the management directory 
RGj, 

- the parallel management processor PGPj disappears, 
since the common bus BUSA no longer exists and the functions 
which- had been attributed to it are turned over to the series 
link management processor PGSj; 

- the snooper processor PE^ is replaced by a coher- 
ence maintainance processor PMCj which takes care of keep- 
ing the state bits of the blocks in the cache memory MCj in 
order to assure the coherence and which "is active at the sole 
initiative of the memory management processor PGMj via the 
synchronization bus SYNCHRO; 

- the request management processor PGO no longer knows 
a single partenary: the series link management processor 
PGSj, on which it reports all its requests; 

the series link management processor PGSj is 
charged with the transfer of the addresses and the data, con- 
forming to the principle described for the system of FIGURE 
6, each address being prefixed by the nature op the request; 

- the functionalities of the memory management process-, 
or PGM are those described in reference to FIGURE 17, its act- 
ivation no longer being assurred by the signal valid", which 
disappears (since previously associated with the common bus 
BUSA), but by the arbitrator ABM described in the system of ' 
FIGURE 6, which serializes the requests for service which 
move by the series links* The memory RAMFG is also constitut- 
ed by a supplementary field cpu associated with the state bits 
ro.rw. 
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The general funct loaning of the embodiment shown in 
FIGURE 18 is as follows: 

Each request of the treatment processor CPU activates 
the request management processor PGUj with the reading or 
writing indication and code or data. This processor requires 
an access to the management directory RGj after the direct- 
ory management processor in comparison with the directory man- 
agement processor PGBj. The consultation of the directory 
leads to one of the following cases: 

- the block is present in the cache memory MCj, with • 
an unmodified state (m = 0)': if the request is fqr reading, 
the information requested is extracted from the cache memory 
MCj and furnished to the treatment processor CPUj. If 
the request is for writing, then an informative writing 
request ei is transmitted to the series link management pro- 
cessor PGSj; . 

- the block is present in the cache mem.ory MCj, with 
a modified state (m « 1); the request rea:ding or writing, is 
satisfied? 

- the block is. absent from the cache memory MC^; a 
block request for reading 'or writing is transmitted to the 
series link management processor PGSj. 

Thus, the requests made to the series link management 
processor may be: a request for reading non-shared data 
(code): dnp, a request for reading the block: dl, a request 
for reading of the block in order to carry out there a 
writing: de, a request for informative writing: ei. 

At these various states, it is necessary to add the 
state maj corresponding to the purging of a block, either at 
the request of the coherence maintenance processor PMCj, or 
for liberating a placement of a block in the cache memory. 
"The* addresses thus prefixed move by the series links LS^, 
and conformingly to the principle set out during the descrip- 
tion of the architecture of FIGURE 6, urge the arbitrator ABM 
when: 
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in case of reading a block, the address is 
transmitted, 

- in case of writing a bXoclc, the address and the data 
are transmitted. 

These requests are treated sequentially by the memory 
management processor PGM, of the same general structure as 
that described in FIGURE 17. Their treatment is as follows: 

1/ dnp: request for non-shared data. The block is 
transmitted and takes the state ro.rw = 10. 

2/ dl: request for reading of a block. 

If the block Is in the non-broadcast state (ro.rw = 
00) or -broadcast in reading" (ro.rw = lO), it is transmitted 
and takes or keeps the state ro.rw = 01. 

If the block is in the state of broadcast in writing, 
the request is sent into the »*aiting line AFIPO. The memory 
management processor. PRM then finds in the field cpu of the 
memory RAMFG the address of the cache memory MC^ which con- 
tains the current version of the requested block^ A request 
for purging is then eniitted on the synchronization bus 
SYNCHRO, to the sole destination of the coherence maintenance 
processor pmc^ associated" with the cache memory MC. con- 
cerned. This request may be qualified by the addressed 
command . 

One wil note that the coherence maintenance processor 
PMC^ has not consulted the associated management directory 
RG. since the memory management processor PGM knows of the 
fact that it is the only possessor of the current copy. Its 
role consists simply of removing the request and depositing 
it in the associated file FIFO^^. 

3/ de: request for reading of a block in order to 
make, a writing . 

-If the block is in the "non-broadcast" state (ro.rw = 
00), it is transmitted and takes the "broadcast in writing" 
state- (ro.rw = 01) . 

If the block is in the "broadcast in reading" state 
(ro.rw « 10), then the manory management processor emits a 



73 



universal block invalidation command, then transmits the bloc 
with the state "broadcast in writing" (ro.rw « 01>. the univ- 
ersal command provokes the activation of all of the coherence 
maintenance processors PMC^ which execute strictly the same 
operations as those described for the system of FIGURE 10. 

If the block is in the "broadcast in writing" state, 
the request is sent to thta waiting line APIPO. As before, 
the memory manistgement processor PGM sends a command addressed 
to the only possessor of the current copy. 

4/ maj: request for writing of a block after a purge. 
The operating algorithm is in this case strictly the 
same as that described in reference to FIGORE 17 for the PGM 
processor • 

It will be. noted that the problem of discharge of the 
writing naturally found its solution in this embodiment by art 
addressed command for discharge. 

5/ ei: informative writing. 

This case is treated directly on the coioamun bus BUSA 
in the architecture presented in FIGORE 10. In the 
embodiment provided here, and in order ot assure the 
synchronization, this operation is controlled by the memory 
management processor PGM, 

If the block is in the state "broadcast in reading", 
then a - command, at the time universal and addressed, is sent 
out: addressed in the sense that the coherence maintenance 
processor PMCj concerned notes the discharge of the request 
for informative writing and passes the block concerned into 
the state "modified" in the management directory RGjf 
universal in the sense that all the other processors PMC^ 
must invalidate this block in their directory. 

The block . in the -broadcast in writing" state indi- 
cates that an informative writing request has been treated on 
this same block during the time waiting for treatment of the 
request. In this case, the informative writing request is 
transformed into a writing request de , and follows the same 
treatment as in the corresponding case of writing. 
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The parallel synchronization bus SYNCHRO is charged 
with broadcasting the block addresses prefixed by a processor 
number and a type of request, to be about 30 to 40 bits 
according to the characteristics of the multiprocessor. 
These data are moreover transmitted in a unidirectional man- 
ner. Their transfer may advantageously be made there by a 
series link. Their transfer rate is less critical than for 
the block, and simplified solutions may therefor be envision- 
ed, for example by the bias of "TAXI" circuits made by the 
A.M*D. Company. 

PIGORB 19 shows a partial block diagram of ..one archi- 
tecture according to the invention, in which several central 
units UCj^... are Connected in clusters and share the same 
series link LS^^. In this goal, a local arbitrator ABLj^ 
associated with the cluster is charged with arbitrating the 
access conflicts by means of communication of block 
addresses, and sharing with the memory processor PGM, trans- 
ported for this purpose, a signal busyj^ indicating perman- 
ently the free state or occupied by the series link LS^. 
Means for coding and decoding of one at the head of the 
identification of the processor concerned at the interior of 
a cluster are associated with emission logic and reception of 
blocks of data. 

In the case where the address communication means for 
blocks are constituted by the . common bus BUSA, the operation 
is as follows: 

If the central unit CPUj^^j desires to make a trans- 
fer of a -block in the direction of the central memory RAM to 
a cache memory MCj^^j (case dnp, dl, de) or carry out an 
informative writing ei, then it requires access to the common 
bus B03A by the local arbitrator ABLj^, which propagates the 
request, its turn coming, to the arbitrator AB. The accord 
for access to the common bus BOSA is re- sent to the central 
unit CPUj^^j and the transfer is carried out in the manner 
described in reference to . FIGURB 10 • All blocks transmitted 
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in the direction of the central memory RAM to the cache mem- 
ory MCj^^. must then be identified, since the order of the 
requests ''is not respected of the fact of the possibility of 
being waiting in the line AFIPO of the memory management pro- 
cessor PGM. If the central , unit CPU^+j desires to make a 
transfer of a clock in the direction of the cache memory 
MCj^^j toward the central memory RAM (case maj)/. then it- 
requests first of the local arbitrator ABLj^ access to the 
series link LS^^. The local arbitrator ABLj^ and the mem- 
ory management processor PGM are both susceptible of authoriz- 
ing the series link LS^: the contention is avoided by sync- 
hronizing the modification, of the signal busy^ with the 
signal valid (the memory management processor PGM only activ- 
ates or reactivates a transfer during a memory transaction) . 
The occupation accord of the series link LSj^ l^ads the cen- 
tral unit CPOj^^j to transfer its block of information to 
the memory shift register RDMj^, then to request the local 
arbitrator ABLj^ for access to the common bus BOSA in order 
to carry out there the operation request, which is carried 
out according to the algorithm described in reference to 
FIGURE 10. The writing of the operation may cause a libera- 
tion of the block in the waiting line AFIPO of the memory man- 
agement processor PGM and request an occupied shift register 
RDM.. In this case, the transfer requested is delayed and 
chained with the transfer in progress. 

In the case where the block address communication 
means is the series links themselves, the operation is then 
identical to the preceding case for the preemption of the ser- 
ies link, and identical for the general operating algorithm 
to. that shown with reference to FIGURE 17. 

For example,. a reading request of the block emitted by 
the central unit CPO^^^necessitates first an access accord 
tb the series link LSj^, an accord given by the local arbit- 
rator ABLj^ in concert with the memory management processor 
PGM. The access accord causes the' transfer of the address of 
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the block requested on the series link LS^ which is immedi-- 
ately liberated: it is available for any other transaction 
if necessary. A writing request of a block follows the same 
protocol for access to the series link LSj^. 

In the architecture described with reference to 
PIGDRES 1 to 19, there exist as many memory shift registers 
RDMj as central units CPUj: one series link LSj being^ 
provided in a static manner' to a couple (RDM . CPU j) . 

If there must evidently be at least one series link 
LSj between a central unit and the central memory RAM, the 
number of shift registers RDM^ may be less. In effect r if 
tacc is the time for access to the central memory RAM, and 
ttfr the time for transfer of block, it is not possible to 
keep more than n » ttfr/tacc shift registers simultaneously 
occupied. For example', for tacc- = 100 ns and ttfr « 1200 ns, 
one obtains n » 12 . 

Tacc and ttfr are then the criteria characteristic of 
the performance of the multiprocessor system according to the 
invention and putting in place n memory shift registers 
RDMj is compatible with a greater number of series links 
LSj that the condition of interspersing a logic of the 
interconnection network type RI between registers and links, 
the allocation of a memory register RDMj to a series link 
LSj being achieved in a dynamic manner by the memory manage- 
ment pro cesisor PGM. 

Further, the central memory RAM will generally consti- 
tute m memory banks RAM^^, , . •RAMp, RAMjj^ arranged in para- 
llel, each memory bank comprising n shift registers RDMj 
connected by an interconnection network Rip to the assembly 
of series links LSj. It is then possible, under reserve 
that the addresses of the blocks be uniformly distributed in 
the memory bank« RAM^, to obtain a theoretical performance 
of m X n shift registers simultaneously active. The uniform 
distribution of the addresses is assurred by conventional 
mechanisms for interlacing the addresses. 
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In FIGURE 20a^ there is partially shown an architect 
ture conforming to the invention, comprising m memory banks 
RAM with n shift registers RDM^ per* memory bank and q 
central units OCj, Each memory bank is of the random 
access type provided with an input/output of data of a size 
corresponding to a block of information bi , this input/output 
being (as before for the memory RAM) connected by a parallel 
bus to the assembly of elementary registers RDM-^p. •RDMjp. 

The interconnection network is of a known structure 

("cross-bar," "delta," "banyan,"...). One will note that a 
multi-stage network is well suited in the case where the time 

for establishing ' a path is negligible before its time of 

occupation (the time of transfer of a block) and that: it only 

concerns one bit per link. 

The memory management processor PGM is adapted to be 

assured of the dynamic allocation of an output and an input 

of the network, that is putting in relation a memory shift 

register. RDMj and a series link LS^^. 

In the case where the block address communication. 

means constitutes the common bus BUSA, the operation is as 

follows: 

In the case of a request for reading of a block of the 
part of the central unit CPUj, the memory management 
processor PGMp concerned allocates a shift register RDM^/ 
controls the interconnection network RI as a consequence, and 
initializes the transfer. 

In the case of a request for writing of a block of the 
part of the central unit CPU^r a path must first be estab- 
lished. To this end, a first request for establishment of 
the path is sent on the common bus BUSA, followed by the writ- 
ing request effecting the transfer of the block from the 
cache memory MCj 'to the shift register RDM^^. At the time 
of the first request, the memory management processor PGM is 
charged with allocating a path and controlling the ihtercon- 
nection network RI. . 
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In the case where the block address cotatnuni cation 
means is the series links themselves, a path must be estab- 
lished for all preliminary transfer. This problem is identi- . 
cal to the classical problem of sharing of an assembly of n 
resources by m users and may be controlled by classical solu- 
tions for arbitration of access conflicts (communication 
protocols, supplementary signals). 

In the example of the architecture shown in FIGURE 5, 
the shift registers RDMj and RDPy their validation logic 
LVl and LV2 were provided with high speed, the assembly being 
synchronized by a clock of a frequency F at least equal to 
100 MBz. 

FIGURE 20b shows a variation of the architecture pro- 
posed in .FIGURE 20a, a solution conforming to the invention 
in which each series link LS^, which connects the processor 
CPU- to all of the mKHory banks, is divided into m series 
links LSjp connecting by point to point the processor 
CPU. to each of the memory banks RAMp. 

This process presents the double advantage following: 

- each link being of the point to point type, may be 
better adapted from the electrical point of view or the fiber 
optic point of view, 

- a level of supplemental parallelism is obtained 
starting from when the treatment processor is in the state of 
anticipating block requests, which is the case actually for 
the for the highest performance processors. 

The interface logic (noted previously TPR^ and 
RDP.) which is associated with the" series link LSj, side 
processor CPUj, is then duplicated into m copies I]_-««Ip- 
...I . One will note the presence of a link for maintain- 
ing the coherence of information, proprietary to each memory 
bank RAMp. The operation of this link is analogous to 
that of the bus SYNCHRO of FIGURE 18. 

There is shown in FIGURES 21a and 21b another struc- 
ture of the memory RAM which comprises 2^ memory planes. 
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each memory plane having a front o£ binary data (for 

reasons of clarity, in FIGURE 212a is shown the means necess- 
ary for the reading of a block bi, and in FIGURE 21b the 
means necessary for writing) • The shift registers RDMj or 
RDPj are constituted of 2^ elementary sub-shift registers 
RDMjp with t/2^ bits of capacity. The example shown in 
FIGURE 20 is a realization of 8 memory planes (u=3> . (For 
clarity of the drawing , there is shown a single shift regist- 
er RDMj formed for' the assembly of sub-^ registers RDM^p) . 
Each memory plane RAM comprises opposite its access front 

an assembly of elementary shift registers RDM. attd is cap- 

u 

able of operating with a shift frequency of at least P/2 . 

The operation of the assembly in case of reading is 
illustrated in FIGURE 21a. A block is read in a synchronous 
manner in the assembly of 2^ memory planes and loaded in 
the same manner in the elementary registers of the same rank. 
The series outputs of these registers are connected to the 
inputs of a multiplexer MUXR made of high speed technology 
(ASGA) . A circuit of this type, perfectly adapted, is avail- 
able at "Gigabit Logic" under the name "LOGO^O.", and is cap- 
able of delivering a logic signal at a frequency of 2.7 GHz. 
It also furnishes a clock frequency divided by eight, which 
constitutes the clock for shift of the elementary registers 
RDM. p. 

In the case of writings a symmetrical operation, shown 
in FIGURE 21b, is obtained with a multiplexer circuit DMUXR 
of the same manufacturer (known as "L0G41")., with the same 
characteristics of performance. 

Thus, a transfer frequency of 500 MHz is obtained with 
8 elementary registers operating at a frequency of 500/8 - 
62.5 MHz, which makes then able to be accomplished with more 
conventional technology ("MOS" for example) . 

The multiplexer and demultiplexer circuits reference 
hereabove are combinable into assemblies of 16, 32,... bits. 
Thus, by associating respectively 16, 32 memory planes 
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operating at 62.5 MHz, it is possible to obtain flow rates of 
1 and 2 GHz, or a level of performance 2 to 4 times greater. 

It should be noted that the logic TPR may. be provided 
on one of the elementary registers, and that the validation 
logic LV is integrated with the circuits "ASGA" (output with 
open collector). 

FIGURE ^ 22 shows the, general structure of a component ' 
of the "VLSI" integrated circuit type, called a multiport 
series memory and capable of equipping a multiprocessor sys- 
tem according to the invention. This component may be used 
in multiprocessor architecture described previously, either 
to provide the central memory RAM and the associated shift 
registers RDMj, or for profiding each cache memory MC^ 
and its shift register RDP^. To simplify the notations, in 
the description which follows, the symbols relative to the 
central memory RAM and the associated shift registers will be 
retained. 

The list of pins of this circuit with the correspond- 
ing signals is as follows: 

- adblocQ-adbloc^^j^ : m address bits of block bi, 

admotQ-admotj^_j^ : Ic bits of word addresses in 

the block, 

numregQ-numreg^_j^ : n bits of register 
addresses rd, 

- cS : "chip select" : circuit selection signal, 

- wr : "write" : signal for writing, 

- rE : "read" : signal for reading, 

bit/bloc ; ^control signal for the multiport 

function, 

- normal/conf ig : signal for the operating mode, 
• - dataQ-data^^^j^ : 1 bits of data, 

- h^-h^ z n clock signals, 

- ^i-d^ : n data signals. 

The values m, n,. 1 are functions of the current state 
of technology. The actual values may be as follows: 
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- m 16 or 2"^° blocks bi of 64 bits each (or 4 

Mbits), 

- n s= 3 or 8 registers rd, 

- 1 =5 8 or a parallel interface of the octet type, 

- k = 3 because of the fact of 8 octets per block. 
The component provided comprises about 50 pin6. 

This series multiport memory circuit is composed of a 
random access read-write memory RAMr of a predetermined size 
t| capable. of being controlled in writing on the fronts inde- 
pendent of the size t/4 (value chosen by way of example in 
FIGURE 22) and t/1 . The data lines of this memory RAM are 
connected to the inputs of a logic of the '•barrel shifter" 
type BS, or of the multiplexing MT^ depending on the version- 
of the component/ the multiplexing logic MT may be considered 
as offering a sub-assembly of possibilities of barrel shifter 
logic and therefor simpler to provide. The address signals 
and the control signals of this memory RAM^ namely csi, wri^ 
rdi/ adbloci r are delivered from a control logic COM, This 
logic CCDM receives further the information signals* from the 
pins cs, wr, rcT^ bit/bloc , normal/conf ig , numreg and is con- 
nected by the control lines "format" to the barrel* shifter 
logic BS/ and to the output of a configuration register RCl/ 
and to the input of a- selection logic LSR providing the sig- 
nals srdp, • . .srdj^^j^ and srcj^, srcj/ src^- The 
outputs, of the barrel shifter type of logic GS constitute an 
internal parallel communication bus BUSI, connected to an 
assembly of shift registers RDq, RD^^^^, on their para- 

llel inputs r for one part, on their parallel inputs, ajid for 
the other part on their parallel outputs through the valida- 
tion buffers BVIOOq,... BVlOO^^^ and to the parallel 
input of . the configuration registers RC^^, RCj... RC^ . 

The 1 bits of low significance of the bus BOSI' are 
also received on the 1 pins, da tag,... data^^.j^. Each 
shift register RD^^ and the associated logic gates consti- 
tute a functional unit SLRO^ 9 controlled by an assembly of 



82 



logic elements which constitute a forcing logic LF^^. Each 
functional unit ELROj^ comprises logic gates BTlOO^ and 
ETIOI. , connected on one of their inputs to the output 
srd. ^of the selection logic LSR, and receiving on their 
other input respectively the signals rd^ and wr^. The 
output of the logic gate ETlOOj^ is connected to the input 
loadlOO. of the shift register RD^, as well as the input 
loadlOl^ and the input S respectively of a counter 
CPTIOO^ and of a flip flop BlOO^^ appurtaining to the forc- 
ing logic LP^. The output of the logic gate ETlOlj^ is 
connected to the control input of the validatiqn buffers 
BVlOOj^. The output di is connected to the output of a 
logic^gate PL^., which receives on its data input the series 
output of the shift register RD. and on its control input 
the output of a logic gate OUIOO.. The signal sent from 
the pin h. is delivered to the input cllclOO. of the regis- 
ter RD.. as well as the input downlOO. of the counter 
CPTIOO^. The output zerolOOj^ of the counter CPTlOO^ is 
connected to the input R of the flip flop BlOOj^. 

The farcing logic LFj_ comprises in addition a multi- 
plexer MUXlOOj^ which receives on its data inputs the values 
t and t/4. The data output of the multiplexer MOXlOOj^ is 
conected to the data input of the counter CPTIOO., and the 
selection command sellOOj^ of the multiplexer MDXlOO^ is 
connected to the output 1 of the register RC^. The output 
Q of the flip flop BlOO^ is' connected to one of the inputs 
of a logic gate ET102j^, which- receives on its other input 
the signal issued from the pin i of a register RCj. 
output of the logic gate ET102j^ is connected to one of the 
inputs of the logic gate OUIOO., which receives on its 
other input the signal issued on the pin i of a register 
RC,. The loading inputs of the registers RCy RC2/ 
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receive respectively the- signals srcj^, srcjr srcj 
issued from the selection logic LSR. 

This component presents a duality of f unctionning : if 
the signal. bit/bI3c is in the "bit" " state, then the 
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functionhing of this component is that of a conventional semi- 
conductor memory: the signals adbloc associated with the sig- 
nals admot constitute the address bus in the word unit' (8 
bits in the example), the signals cs, rcT, wr have the usual 
direction attributed to these signals, and the data pins 
convey these data. 

In an internal manner , in reading , the. block of 
information designated by adbloc is read in the memory RAM 
and presented at the input of the barrel shifter logic BS or 
of the multiplexing MT. The combination of signals admot and 
bit/bloc . permit the control logic COM to provide to the 
barrel shifter logic BS or the multiplexing MT the "format" 
signals. The word concerned is then right justified in the 
output of the barrel shifter logic or the multiplexing and 
presented thus on the data pins data. 

In an internal manner, in writing the word presented 
on the data lines data is aligned by the barrel shifter logic 
LS or the multiplexing MX by the same signals of the control 
format as in reading, with regard to its position in the 
block. The control logic COM emits then a partial writing 
signal wri on the only leg of the memory concerned, and to 
the address designated by the signals adbloc. 

If the signal bit/bloc is "in the "bloc" state, then 
the operation depends on the state of the normal/conf ig sig- 
nal. The config mode programs the configuration registers 
RCj^/ RCjr RC^ addressed by the signals numreg, and pro- 
graxraned from the data lines data* The register RC^ permits 
modifying the size of the block r t and t/4 in the example, 
either 64 bits and 16 bits. In an internal manner, the func- 
tionning is similar to that described in the "bit" embodi- 
ment.: t or t/4 bits are aligned on the internal bus BOSI (in 
reading) , or opposite the leg of the block concerned (in writ- 
ing) • The sizes of the multiple blocks may bei envisioned (t, 
t/2, t/4-. .) • 

The- register RC^ permits selecting for, each register 
a permanent direction of operation: either in the input 
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(RC3^ 0) or in the output (RC2^ =1) . This permanent dir- 
ection permits adapting the component to series links with 
permanent unidirectional connections • The register RC2. 
permits choosing for each register, under reserve that the 
corresponding bit. of RC3 be at the logic state 0, a mode of 
operation with alternate bidirectional links: on a reading 
memory RAM, the shift register RD^^ concerned passes to the 
output mode for the time of transmission of the block r then 
returns to the rest state in the input mode. In an internal 
manner, the flip flop BlOOj^, which controls the logic gate 
PL^, is set to 1 on a loading signal of the register RDMj^ 
and reset to zero upon issue of the transifer of t or t/4 
bits, through the counter CPTlOOj,,' initialized at t or t/4 
according to the state of the register RC^, and which 
receives bn its counting input the clock pulses hi. In nor- 
mal operation ( normal/conf ig signal in the normal state) for 
a reading, the. block addressed by the pins adbloc is loaded 
in the register RD^ addressed by the numreg pins. If the 
block is partial (t/4), then it is transmitted in a position 
of low significance on the internal bus BUSI by the barrel 
shifter type logic BS or the multiplexer MT. This block is 
then transmitted upon activation of the clock signal hi. 

In the normal operation for a writing, the contents of 
the register RD^ addressed by the numreg pins is written in 
the block memory RAM with the address adbloc. If the block 
is partial, it is transmitted into a position of high signifi- 
cance on the internal bus BUSI, then aligned opposite the leg 
of the block concerned by the barrel shifter logic BS or the 
multiplexer MT, and finally a partial writing signal wri is 
sent out on the leg concerned. 

It should be noted that if a partial block is in 
service, ' then the address of this partial block in the block 
is furnished by the address lines admot. 

This component is perfectly adapted to the diverse var- 
iations of the architecture described. Associated in 
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parallel, 8, 16... circuits of this type permitting provision 
of the apparatus described in FIGURES 20a, 20b. If the 
memory RAM is of high speed technology, then this component 
may also be used at the level of the cache memory, while 
multiplexing, according to the device described in FIGURES 
20a, 20b, the internal registers of a same component. 
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